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Problem Solved [Describe the problem solved by this invention] ' \ ' 

This invention enables the creation of fusions between nucleic acids by the insertion of one piece into 
another m order to functionally couple existing functionalities, modulate existing functionalities or create novel 
functionalities. The following list of applications that these engineered nucleic acids or proteins can have is 
illustrative of then- broad potential: (a) regulation of gene transcription, (b) modulation of cell signaling pathways 
(c) targeted drug delivery, (d) drug transport, (e) conditionally active toxic protems. (f) metabolic engineering (g)' 
biosensors and (h) controlled drug release. Domain insertion is the most promising and most general strateg^ for 
engmeenng a molecular switch, as other strategies are limited in the sorts of signals that can be employed or the 
types of protems that can be controlled. However, there are several limitations of existing domain insertion 
strategies that this invention overcomes: (1) they are not combinatorial. A combinatorial approach systematically 
tests a plurality of potential switches to find the optimum switch. (2) The switching behavior achieved by existing 
methods was generally modest (less than 2-fold effect). (3) None of the studies have explored circular permutation 
of either the target or the inserted gene; thus, existing domain insertion strategies can only explore a limited number 
of geometnc configurations between the two domains. 



Novelty [Identify those elements of the invention that are new when compared to the current state of the art] 
To our knowledge, a cu-cularly permuted gene that has been circularly permuted at a specific site and 
inserted into another gene (i.e. by domain insertion at a specific site or by random domain insertion) has never been 
descnbed. To our knowledge, the insertion of a randomly circularly permuted gene into another gene (i e by domain 
insertion at a specific site or by random domain insertion) has never been described. Furthermore, the idea that such 
a process will create a molecular switch does not have precedent and is not obvious. 



ACKNOWLEDGMENT, CERTIFICATION and ASSIGNMENT OF INVENTION 

In order for this Report of Invention to be complete and processed by LTD, it must be signed and dated by: 

(1) the JHU Department Director for each JHU department involved with the development of this 
invention (SECTION A), and, 

(2) ALL Inventors (SECTIONS B and C), including those Inventors not subject to The Johns 
Hopkins University Intellectual Property Policy. Each Inventor must complete only one of 
Sections B or C (See explanations below). 

(3) Please duplicate Sections A, B and/or C as needed for proper completion with ALL 
appropriate signatures. 


SECTION A. JHU DEPARTMENT DIRECTOR'S ACKNOWLEDGEMENT 
I have read and understood this Report of Invention. 


0 

JHU Departnteht Director Signature 




Date 


Typed or Printed Name 


JHU School / Department 


JHU Department Director Signature 




Date 


Typed or Printed Name 


JHU School / Department 


JHU Department Director Signature 




Date 


Typed or Printed Name 


JHU School / Department 



SECTION B. JHU INVENTOR CERTIFICATION and ASSIGNMENT 

This section is to be completed only by those JHU personnel subject to The Johns Hopkins University Intellectual 
Property Policy. Non-JHU Inventors, and HHMI or KKI Inventors at JHU are subject to a separate assignment and 
must complete Section C. JHU Inventors who believe they are not subject to The Johns Hopkins Univereity 
Intellectual Property Policy for the invention described herein must complete Section C. 

I/we, the Inventors, hereby certify that the information set forth in this Report of Invention is true 
and complete to the best of my/our Itnowledge. 

I/we, the Inventors, hereby certify that I/we will promptly advise LTD of any commercial interest 
regarding the invention described herein. 

I/we, the Inventor(s), subject to The Johns Hopkins University Intellectual Property Policy and not 
under an obligation to assign intellectual property rights to another party, hereby affirm that in 
consideration for The Johns Hopkins University's evaluation of commercial potential and a share of 
income which I/we may receive upon commercialization of my/our invention, on the date of my/our 
signature(s) as indicated below do hereby assign and transfer my/our entire right, title and interest 
in and to the invention described herein unto The Johns Hopkins University,- its successors, legal 
representatives and assigns. 


li Ir- 

JHU Inventor Signature 


Typed or Printed Name 


Date 


JHU Inventor Signature 


Typed or Printed Name 


Date 


JHU Inventor Signature 


Typed or Printed Name 


Date 


JHU Inventor Signature 


Typed or Printed Name 


Date 


JHU Inventor Signature 


Typed or Printed Name 


Date 



Ostermeier and Guntas: Molecular Switches Involving Circular Permutation 

1/26 

Molecular Switches and Methods for Making and Using the Same 
Involving the Circular Permutation of DNA 

Abstract 

The invention provides molecular switches which couple external signals to 
functionality and to methods of making and using the same. The switches according to 
the mvention can be used, for example, to regulate gene transcription, target drug 
delivery to specific cells, transport drugs intracellularly, control drug release provide 
conditionally active proteins, perform metabolic engineering, create molecular sensors 
and modulate cell signaling pathways. Libraries comprising the svdtches and expression 
vectors and host cells for expressing the switches are also provided. These libraries 
mvolve the circular permutation of DNA. 

Background 

Molecular Switches. A hallmark of biological systems is the high degree of 
interactions amongst and within their constituent components. One advantage that such 
interactions bring is the establishment of coupling between different ftinctions A protein 
that couples two functions can be described as a molecular switch. For example an 
allosteric enzyme is a switch that couples effector levels (input) to enzyme activity 
(output). In most general terms, a molecular switch couples signals (e.g. Ugand binding 
protein-protem interactions, pH, covalent modification, temperature) to functionality (e £ 
enzymatic activity, binding affinity, fluorescence). Molecular switches can be of an 
onyoff ' nature or such that the signal modulates the function between two different 
levels of activity. The network of such molecular switches establishes the complex 
circuits that control cellular processes. 

The design of molecular switches to modulate or report on biological functions 
has enormous potential for a variety of appUcations including the creation of biosensors 
(Siegel and Isacoff 1997; Baird, Zacharias et al. 1999; Doi and Yanagawa 1999- de 
Lonmier, Smith et al. 2002; Fehr, Frommer et al. 2002) modulators of gene transcription 
and cell signahng pathways (Rivera 1998; Guo, Zhou et al. 2000; Picard 2000) and novel 
biomaterials (Stayton, Shimoboji et al. 1995) (see also Figure 6). Despite their^ast 
potential, molecular switches have not been explored extensively, in part due to the 
paucity of universal strategies for engineering them and the difficulty in engineering a 
switch that responds to a signal unrelated to the proteins' function and activates in the 
presence of the signal (as opposed to, for example, active-site inhibitors). In general 
existing strategies for creating switches are inherently limited in the nature of the 
function that can be controlled, the signal that can be employed, the lack or reversibihty 
the lack of sensitivity, or the requirement for additional cellular components. 

Domain Insertion. Gene fusion technology, the fusion of two or more genes into 
a single gene, has been widely used as a tool in protein engineering, localization and 
punfication. There are two conceptually different methods of making ftisions. The 
simplest method of end-to-end fusions has been used almost exclusively. The second 
method is insertional fusion in which one gene is inserted into the middle of the other 
gene. Insertions result in a continuous domain being split into a discontinuous domain In 
nature, although continuous domains are more common, discontinuous domains are not 
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rare (Russell and Ponting 1998) and a systematic survey of structural domains indicated 
that 28% of structural domains are discontinuous (Jones, Stewart et al. 1998). 

The observation of insertions in naturally occurring proteins suggests that such a 
route can be viable to construct proteins with desired properties and function. 
Furthermore, insertional fusions offer an advantage in numbers, as there are many more 
msertional fusions that can be made between two proteins than end-to-end fusions. 
Whereas it is expected that many of these insertional fusions will not be able to fold 
correctly, studies on insertions of DNA coding for insertions of a few amino acids to 
entire proteins, summarized below, have succeeded in creating active, often fully active 
hybrids. 

Early work on one or two codon insertions indicated that such small insertions 
had a smaller effect on activity than larger insertions (Boeke 1981; Stone, Atkinson et al 
1984; Barany 1985; Barany 1985; Freimuth and Ginsberg 1986; Barany 1987). Work in 
Paul Schimmel's lab in the late 1980's showed that insertions of random sequences into 
E. coli methionyl-tRNA synthetase of up to 14 amino acids could be tolerated in select 
locations (Starzyk, Burbaum et al. 1989). Subsequently other large insertions (Zebala 
and Barany 1991; Ladant, Glaser et al. 1992; Hallet, Sheiratt et al. 1997), even a 
randomized 120 amino acid library insertion (Doi, Itaya et al. 1997) have confirmed this 
plasticity of proteins for insertions. 

The first example of successful insertion of one protein into another was of 
alkaline phosphatase (AP) into the E. coli outer membrane protein MalF, constructed as a 
tool for studying membrane topology (Ehrmann, Boyd et al. 1990). High levels of 
alkaline phosphatase activity were obtained in the fusions despite the fact that alkaline 
phosphatase requires dimerization for activity. Since then, AP has been successfully 
inserted into a number of integral membrane proteins (Bibi and Beja 1994; Lacatena 
Cellini et al. 1994; Pigeon and Silver 1994; Sarsero and Pittard 1995; Pi and Pittard i996- 
Cosgriff and Pittard 1997). Furthermore, other proteins with their N- and C-termini 
proximal, including green fluorescent protein GFP) (Siegel and Isacoff 1997; Biondi 
Baehler et al. 1998; Kratz, Bottcher et al. 1999; Siegel and Isacoff 2000), TEMl p- 
lactamase (Betton, Jacob et al. 1997; Doi and Yanagawa 1999; Collinet, Herve et al 

2000) , thioredoxin (Lu, Murray et al. 1995), dihydrofolate reductase (Collinet Herve et 
al. 2000). FKBP12 (Tucker and Fields 2001), estrogen receptor-a (Tucker and-Relds 

2001) and p-xylanase (Ay, Gotz et al. 1998), have been successfully inserted into other 



Relatively few studies have examined or attempted functional coupling in 
insertional fusions (i.e. creating a molecular switch). Although the goal of the work was 
not to functionally coupled the proteins, studies of the insertion of P-lactamase into MBP 
(Betton, Jacob et al. 1997) and the insertion of P-lactamase or DHFR into yeast 
phosphoglycerate kinase (PGK) (Collinet, Herve et al. 2000) indicate that a minimal level 
of functional coupling of the two activities can exist, even in fusions designed to 
minimize interaction between the two domains. More recently, yeast sensors for ligand 
binding were constructed by the insertion of FKBP12 and estrogen receptor-a ligand- 
binding domain (EPa-LBD) into a rationally chosen site in dihydrofolate reductase 
(DHFR) (Tucker and Fields 2001) in the first designed coupling of growth rate of an 
organism to a small molecule hgand by domain insertion. Yeast expressing the FKBP12- 
DHFR or ERa -DHFR fusion proteins had an approximate two-fold increase in growth 



proteins. 
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rate in the presence of their respective ligands (FK106 and estrogen) when DHFR activity 
limited growth. However, in vitro neither the switches activity nor stability significantly 
changed upon ligand binding and the mechanism for increased growth rate of cells 
bearing these genes is not clear. 

Domain insertion for has been used to couple ligand binding and changes in 
fluorescence. The optical signal transduction of the green fluorescent protein (GFP) has 
made it an attractive target for engineering biosensors by domain insertion (Guerrero and 
Isacoff 2001). GFP has been inserted into voltage-gated channels for potassium (Siegel 
and Isacoff 1997) and sodium (Ataka and Pieribone 2002) to generate sensors in which 
voltage driven rearrangements in the channel alter the brightness of GFP by 5.0% and 
0.5% respectively. Initial attempts at creating such molecular sensors by inserting j3- 
lactamase into GFP (Doi and Yanagawa 1999) were unsuccessful; however, random 
mutagenesis on the fusion was able to create a protein whose fluorescence increased 60% 
upon binding of the p-lactamase inhibitory protein. Insertions of calmodulin (a Ca^" 
binding protein) into GFP resulted in a fusion whose fluorescence changed up to 40% 
upon increases in Ca^* concentration (Baird, Zacharias et al. 1999). In a related strategy 
the gene for a circularly permuted GFP was sandwiched between the gene's for 
calmodulin and it's target peptide M13 to create a series of sensors whose fluorescence 
mtensity increased, decreased or showed a excitation wavelength change upon binding 
Ca2 (Nagai. Sawano et al. 2001). The GFP sensors described in this paragraph were 
developed through trial and error. For example, the site for inserting GFP into the 
sodium channel that showed a 0.5% change was the only one of the eight sites tried that 
showed a response (Ataka and Pieribone 2002). 

Domain insertion has also been used in a strategy called mutually exclusive 
folding (ref) in which either one or the other of the two domains can be folded at one 
time. A signal (such as the presence of the ligand of one the domains) can alter the 
equilibrium between which domain is folded (and active). The ligand can stabilize the 
domain it binds to, thus causing the other domain to unfold. However, such a system is 
an off-switch (the signal turns the function that is to be controlled off) and thus is not as 
generally useful for molecular switch applications. 

Random Domain Insertion, We have previously described a method for 
creating molecular switches for which JHU (JHU ref 1706) has filed a provisional patent 
(60/362,588) and a PCT application. A schematic representation of an exemplary 
implementation of this method is shown in Figure 1. The method creates a library of 
random insertions of one gene (or gene fragment) into another gene (or gene fragment). 
From this library molecular switches can be selected or screened for. 

We demonstrated this method by creating two allosteric enzymes (i.e. enzymatic 
molecular switches) (Guntas and Ostermeier 2004). These allosteric enzymes have been 
created by the covalent linkage of non-interacting, monomeric proteins with the 
prerequisite effector-binding and catalytic functionalities, respectively. The fragment of 
the TEM-1 p-lactamase gene coding for the mature protein lacking its signal sequence 
was randomly inserted into the E. coli maltose binding protein (MBP).gene to create a 
domain insertion library. This library's diversity derived both from the site of insertion 
and from a distribution of tandem duplications or deletions of a portion of the MBP gene 
at the insertion site. From a library of -2x10" in-frame fusions, -800 library members 
conferred a phenotype to E. coli cells that was consistent with the presence of 
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bifunctional fusions that could hydrolyze ampicillin and transport maltose in E. coli. 
Partial screening of this bifunctional sublibrary resulted in the identification of two 
enzymes in which the presence of maltose modulated the rate of nitrocefin hydrolysis. 
For one of these enzymes, the presence of maltose increased k^, by 70% and k^/K^ by 
80% and resulted in kinetic parameters that were almost identical to TEM-1 p-Tactamase. 
Such an increase in activity was only observed with maltooligosaccharides whose 
binding to MBP is known to induce a conformational change from the open to the closed 
form. Modulation of the rate of nitrocefin hydrolysis could be detected at maltose 
concentrations less than 1 j^M. Intrinsic protein fluorescence studies were consistent with 
a conformational change being responsible for the modulation of activity. 

Other strategies for creating molecular switches. There are five predominant 
existing strategies for creating protein molecular switches: (1) domain insertion(discussed 
above), (2) control of oligomerization or proximity using chemical inducers of 
dimerization (CID), (3) chemical rescue, (4) fusion of the target protein to a steroid 
binding domain (SBD) and (5) coupling proteins to nonbiological materials such as 
'smart' polymers (Stayton, Shimoboji et al. 1995; Ding, Fong et al. 2001; Kyriakides, 
Cheung et al. 2002) or metal nanocrystals (Hamad-Schifferli, Schwartz et al. 2002). A 
sixth related strategy, control by regulation of aggregation (Rivera, Wang et al. 2000), is, 
strictly speaking, is a method for controlling a protein's availability and not a soluble ' 
protein's activity. A seventh strategy, introducing proximal cysteine residues into a 
protein and then modulating a protein's activity by controlling whether the cysteine forms 
a disulfide is obviously very limited in the signals that can be employed (ref). 

Control using a chemical inducer n f dimerization ( CID) This strategy utilizes a 
synthetic ligand as the CID that controls the oligomeric or proximity of two proteins 
(Spencer, Wandless et al. 1993), best exemplified by signal transduction and gene 
expression (Rivera 1998). The CIDs are small molecules that have two binding surfaces 
that facilitate die dimerization of domains fused to target proteins. This was first 
developed using the immunosupressant FK506 to facilitate dimerization of target proteins 
fused to the FK506-binding protein, FKBP12 (Spencer, Wandless et al. 1993). Several 
variations on this system as well as a system using the antibiotic coumermycin to 
dimerize proteins fused to B subunit of bacterial DNA gyrase (GyrB) (Farrar, Olson et al. 
2000) have appear since. CIDS have been used to initiate signaling pathways by^ 
dimerizing receptors on the cell surface, to translocate cytosolic proteins to the plasma 
membrane, to import and export proteins from the nucleus, to induce apoptosis and to 
regulate gene transcription (Bishop, Buzko et al. 2000; Farrar, Olson et al. 2000). 
However, CIDs have only been applied to those functions that require changes in the 
oligomeric state or proximity of the two proteins. As described in the hterature, this 
approach can not be readily applied to a single protein. 

Chemical Rescue. Chemical rescue is the restoration of activity to a mutant, 
catalytically defective enzyme by the introduction of a small molecule that has the ' 
requisite properties of the mutated residues. Since first described for subtilisin (Carter 
and Wells 1987), chemical rescue has been demonstrated for a number of different 
mutated protein-small molecule pairs (Williams, Wang et al. 2000). The vast majorities 
of these rescues required > 5 mM to showed detectable rescue and the maximum fold 
improvement in activity of the mutant was generally less than 100-fold and required >100 
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mM concentrations of the rescuing molecule. Chemical rescue has only recently been 
applied as a strategy for control, in this case of dimerization (Guo, Zhou et al. 2000). 

Fusion to a steroid bin ding domain. The protein to be controlled is fused end-to- 
end to a SBD (Picard 2000). In the absence of the steroid that binds to the SBD, it is 
beUeved that a Hsp90-SBD complex sterically interferes with the activity of the protein 
fused to the SBD. The disassembly of the complex upon steroid binding restores activity 
to the protein. This strategy has been successfully applied, principally to transcription 
factors and kinases (Picard 2000). Artificial transcription factors using this strategy have 
been developed (called GeneSwitch ) and have promise for tissue-specific gene 
expression in transgenic animals and human gene therapy (Burcin, BW et al. 1998; 
Burcin, Schiedner et al. 1999). 

Coupling to non-biological materials The protein to be controlled is coupled to a 
non-biological material that responds to an external and thereby affects the protein 
coupled to it. 'Smart' polymers that change their conformation upon a change in pH or 
temperature have been conjugated to proteins near ligand-binding sites to create switches 
that sterically block access to the binding site at, for example higher temperature but not 
at lower temperatures (Stayton, Shimoboji et al. 1995; Ding, Fong et al. 2001). Inductive 
coupling of a magnetic field to metal nanocrystals attached to biomolecules resulting in 
an increase in local temperature thereby inducing denaturation, has so far only been 
applied to DNA (Hamad-Schifferli, Schwartz et al. 2002) but presumably will also work 
with proteins. 

Limitations of existing approaches. Domain insertion is the most promising and 
most general strategy for engineering a molecular switch, as the other strategies are 
limited in the sorts of signals that can be employed or the types of proteins that can be 
controlled. CIDs have only been applied to those functions that require changes in the 
oligomeric state or proximity of the two proteins and thus cannot be used to control a 
single protein. The serious limitation of the chemical rescue approach is the inabihty to 
apply the method to any signal and the lack of sensitivity (high concentrations of the 
signal are required for a small change in activity). There are several factors that limit the 
SBD strategy as a general method for controlling any protein. End-to-end fusion appears 
applicable only to fusions with SBDs as no other similar system has appeared in the 
thirteen years since its introduction (Picard, Salser et al. 1988; Filers, Picard et-^. 1989). 
Thus, the potential for the creation of molecular switches triggered by non-steroid signals 
by this method appears very limited. Also, a steroid is used to impart control, but 
steroids have pleioUropic effects and thus are not likely to be useful for engineering 
therapeutic molecules. Finally, the system requires a fourth component, the Hsp90 
complex and apparently has low induction ratios (about ten fold) (Spencer 1996). 
Coupling to non-biological systems are limited in the signals that can be employed. 
Primarily the only signals that have been employed are pH and temperature and it is 
difficult to image that this strategy could be used for signals that are proteins or 
metabolites. 

There are several limitations of existing domain insertion strategies: (1) With the 
exception of our work, all domain insertion studies for creating molecular switches have 
examined a very small number of possible insertional fiisions between the two domains 
(i.e. the insertion locations were rationally chosen). Just as combinatorial approaches 
have proven to be invaluable for improving protein function by directed evolution 
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(Arnold 2001) (e.g. random mutagenesis and DNA shuffling), the development of 
combinatorial methods for domain insertion will likely have a similar effect on the 
development of molecular switches. (2) The switching behavior achieved my existing 
methods was generally modest (less than 2-fold effect). A combinatorial approach 
systematically tests a plurality of potential switches to find the optimum switch. (3) None 
of the studies have explored cu-cular permutation of either the target or the inserted gene; 
thus, existing domain insertion strategies can only explore a limited number of geometric 
configurations between the two domains. 

Circular permutation. A circularly permuted protein has its original N- and C- 
tennini fused and new N- and C- termini created by a break elsewhere in the sequence. 
The first in vitro construction of a circular permuted protein was carried out on bovine 
pancreatic trypsin inhibitor by chemical means (Goldenberg and Creighton 1983). Since 
then, a number of circular permuted proteins have been constructed (Heinemann and 
Hahn 1995), primarily by genetic methods, including TEM-1 p-lactamase (Osuna, P6rez- 
Blancas et al. 2002) used in this study. These studies have shown that circular permuted 
proteins very often fold up into stable, active proteins. Comparisons of primary and 
tertiary structures within several protein families have led to the conclusion that curcular 
permutation occurs in natural protein sequences. 

Random circular permutation. A genetic method for random circular 
permutation of any gene was first described by Graf and Schachmann (Graf and 
Schachman 1996). The method is comprised of the following steps: (i) isolation of a 
linear fragment of double stranded DNA (of the gene to be randomly circularly 
permuted) with flanking compatible ends, (ii) cyclization of this DNA fragment by 
ligation under dilute conditions, (iii) random linearization of the cyclized gene using 
DNase I digestion in the presence of Mn^* at dilute concentrations of the enzyme such 
that the DNase I, on average, makes one double strand break, (iv) repair of nicks and 
gaps using a DNA polymerase and a DNA ligase, and (v) ligation of the fragment into a 
desired vector by blunt end ligation to create the plasmid library of randomly circularly 
pennuted genes. The procedure has been used to systematically identify permissive sites 
for circular permutation in aspartate transcarbamoylase (Graf and Schachman 1996), 
DsbA (Hennecke, Sebbel et al. 1999) and GFP (Baird, Zacharias et al. 1999) and DHFR 
(Iwakura, Nakamura et al. 21000) and in most case a variety of permissible sites-were 
identified. The original random circular permutation protocol (ref) as published works 
poorly, if at all and likely had errors in it (Graf and Schachman 1996). The methodology 
for randomly circularly permuting DNA was improved upon in a study involving the 
random localization of a restriction enzyme site in a fixed length of DNA (Ostermeier 
and Benkovic 2001). 

Novelty. To our knowledge, a circularly permuted gene that has been circularly 
pennuted at a specific site and inserted into another gene (i.e. by domain insertion at a 
specific site or by random domain insertion) has never been described, though such a 
construct has been proposed (but not tested) as having potential for creating GFP sensors 
(Baird, Zacharias et al. 1999). To our knowledge, die insertion of a randomly circularly 
permuted gene into another gene (i.e. by domain insertion at a specific site or by random 
domain insertion) has never been described. Furthermore, the idea that such a process 
will create a molecular switch does not have precedent and is not obvious. 
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Description of the Invention 

This invention combines circular permutation and domain insertion in a novel 
fashion to create molecular switches. The invention is reduced to practice using TEM-1 
P-Iactamase (BLA) and E. coli maltose binding protein (MBP) as model proteins. These 
are the same proteins used to demonstrate how random domain insertion alone can create 
switches (our previous invention). Although the random domain insertion method alone 
has been able to create a BLA-MBP molecular switch, to date the magnitude of the 
switching achieved has been modest (< 2-fold) and only "on-switches" were found (i.e. 
the addition of signal resulted in an increase in function). The switching obtained using 
circular permutation and domain insertion has resulted in both "on" and "off" switches 
and magnitude of the switching has been much higher (in one case, at least 32-fold). 
Although demonstrated with two particular proteins this invention applies to functionally 
coupling any two proteins. 

Key to this invention is the circular permutation of one of the genes (in this case 
the insert gene, although circular permutation of the acceptor sequence is also possible). 
A p-lactamase that has been circularly permuted and inserted into MBP will have a 
different covalent linkage and a different spatial orientation to the MBP compared to the 
original insertion using the wildtype N- and C-termini of BLA. This potentially can have 
dramatic effects on the switching behavior. For example, the switching behavior can 
change due to (1) decreasing the distance from the active site of p-lactamase to residues 
allosterically linked to maltose binding to MBP, (2) exploring new paths by which the 
effects of maltose binding can propagate to the active site of P-lactamase and (3) 
changing the stability of the BLA domain making it more susceptible to changes in 
conformation of the MBP domain. For example, the switches identified by domain 
insertion of a non-circularly permuted BLA (Guntas and Ostermeier 2004) were linked 
through a very stable portion of the p-lactamase (Luque and Freire 2000). Circularly 
permuting p-lactamase will allow linkage through less stable regions that are postulated 
to be more sensitive to the effects of maltose binding. 

Two strategies for creating molecular switches involving random circular 
permutation of the insert gene have been demonstrated (Figure 2) using MBP and BLA as 
an example. In the first, (called "Random Circular Permutation of Insert and Domain 
Insertion at a Specific Site") the BLA gene is circularly permuted and mserted mto a 
specific site in the MBP. This site could be a site previously shown to be useful for 
creating molecular switches (as is demonstrated here) or a site that is predicted, by 
computational methods or other means, to be useful in creating a molecular. In the second 
method (called "Random Circular Permutation of Insert and Random Domain Insertion") 
the BLA gene is randomly circularly permuted and randomly inserted into a plasmid 
containing the MBP gene. The use of BLA and MBP are only examples and potentially 
any two proteins can be fiinctionally coupled in this manner. 

Linkers for Circular Permutation. In order to circularly permute a gene it is 
usually necessary to include DNA that codes for a linker to link the original N- and C- 
termini. We chose to test two different linkers. In the first (the "DKS linker") the P- 
lactamase was randomly circularly permuted by fusing the 5'- and 3'- ends with a DNA 
sequence coding for the tripeptide linker DKS previously found in a combinatorial Ubrary 
of linkers to be most conducive for circularly permuting p-lactamase when the new N- 
and C-teimini were located at a specific location (Osuna, P^rez-Blancas et al. 2002). In 
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the second (the "GSGGG linker") the p-lactamase was randomly circularly permuted by 
fusing the 5'- and 3'- ends with a DNA sequence coding for the flexible pentapeptide 
linker GSGGG. 

Preparation of BLA Insert DNA. The p-lactamase gene fragment bla [24-286] 
(codes for amino acids 24-286) was amplified by PGR from pBR322 such that it was 
flanked by Earl or BamHl sites and sequences coding for the linkers described above and 
cloned into pGem T-vector (Promega) to create pBLA-CP(DKS) and pBLA- 
CP(GSGGG), respectively (Figures 3 and 4). DNA coding for amino acids 1-23 were not 
desired because it codes for the signal sequence that targets 6-lactamase to the periplasm. 
This sequence gets cleaved upon entering the periplasm and is not part of the mature, 
active B-lactamase. 

One hundred and thirty micrograms of pBLA-CP(GSGGG) was digested with 
2000 units of BamHl and 140 micrograms of pBLA-CP(DKS) was digested with 600 
units of Earl in the buffers and conditions recommended by the manufacturer of the 
restriction enzyme. The fragment containing the BLA gene was purified by agarose gel 
electrophoresis using the QIAquick gel purification kit. This DNA was treated with T4 
DNA Ugase under dilute concentrations to cyclize the DNA (18 hours at 16 "C with 600 
Weiss units of T4 DNA Ligase in the presence of 50 mM Tris-HCl (pH 7.5), 10 mM 
MgCl2, 10 mM dithiothreitol, 1 mM ATP, 25 ug/ml BSA in a total volume of 5.1 ml). 
The ligation reaction was stopped by incubation at 65 °C for 20 minutes. The DNA was 
concentrated by vacufuge and desalted using the QIAquick PGR purification kit. The 
cu-cular fragment was purified by agarose gel electrophoresis using the QIAquick gel 
purification kit. 

The conditions for DNasel digestion were determined experimentally by adding 
different amounts of DNasel and analyzing the digested products by agarose gel 
electrophoresis. The digestion conditions were chosen such that a significant fraction of 
DNA was undigested in order maximize the amount of linear DNA that only had one 
double strand break. In general, approximately 1 milliunit of DNasel per microgram of 
DNA (at a concentration 10 micrograms/ml) of for an 8 minute digestion at 22 °C is close 
to being optimum. Sometimes more or less DNasel was requued and thus it is 
recommended that for each library constructed the correct amount of DNasel be 
detennined experimentally by test digestions. The following conditions are given as a 
representative example. Six micrograms of circular DNA was digested with 6 milliunits 
of DNase I (Roche) for 8 minutes at 22 "C in the presence of 50 mM TrisHCl (pH 7.4), 1 
mM MnClj and 50 micrograms/ml BSA in 0.6 ml reaction volume. The reaction was 
stopped by adding EDTA to a concentration of 5 mM. The DNA was desalted using the 
QIAquick PGR purification kit and repaired by 6 units of T4 DNA polymerase and 6 
Weiss Units of T4 DNA ligase at 12 "C for 15 minutes in the presence of 100 micromolar 
dNTP, 50 mM Tris-HCl (pH 7.5), 10 mM MgGla, 10 mM dithiothreitol. 1 mM ATP and 
25 ug/ml BSA. The repaired, linear DNA was purified by agarose gel electrophoresis 
using the QIAquick gel purification kit. This is the circularly permuted gene that is ready 
for insert into another plasmid. 

Preparation of target DNA for random domain insertion libraries. Forty ^g 
of pDIM-C8-Mal was digested with DNasel (0,01 units) for 8 minutes at 22°G in the 
presence of 50 mM Tris-HCl. pH 7.4, 10 mM MnCl^ and 50 ug/ml BSA in a total volume 
of 1 ml. The reaction was quenched by the addition of EDTA to a concentration of 5 mM 
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and the solution was desalted using four Qiaquick PGR purification columns into 200 jxl 
elution buffer which was subsequently concentrated by vacufuge. Nicks and gaps were 
repaired by incubating at 12°C for 1 hour in a total volume of 120 (xl in the presence of 
T4 DNA polymerase (15 units) and T4 DNA ligase (12 Weiss units) in the presence of 50 
mM Tris-HCl, pH 7.5, 10 mM MgCl^. 10 mM DTT. 1 mM ATP, 25 jAg/ml BSA and 125 
^AM dNTPs. The reaction was stopped by incubating at 80°C for 10 minutes. Sodium 
chloride was added to 100 mM and the DNA was dephosphorylated by adding alkaline 
phosphatase (60 units) and incubating at 3TC for 1 hour. The DNA was desalted as 
before and the linear DNA (corresponding to the randomly linearized pDIM-C8-Mal) 
was isolated from circular forms of the plasmid by agarose gel electrophoresis using the 
Qiaquick gel purification kit. 

Preparation of target DNA for site-specific insertion libraries. Plasmid 
pDIM-C8-Mal was modified using overlap extension (Horton, Hunt et al. 1989) to be 
suitable for insertion of the circularly permuted BLA at two specific sites- (a) between 
MB? [1-165] and MBP [164-370] and (b) at the C-terminus of MBP. The plasmids were 
modified in analogous ways, the modifications for insertion between MBP [1-165] and 
MBP [164-370] to create plasmid pDIMC8-MBP(164-165) is described as an example 
(Figure 5). Two inverted Sapl sites were inserted between DNA coding for MBP [1-165] 
and MBP [164-370] in such a manner that digestion with Sapl and subsequent filling in 
of the resulting overhangs using Klenow polymerase in the presence of dNTPs results in 
a perfectly blunt MBP [ 1 -165] on one side and a perfecUy blunt MBP [ 164-370] on the 
other. This is achieved by virtue of the fact that Sapl is- a type IIS restriction enzyme that 
cuts outside of its recognition sequence. Other type HS restriction enzymes could have 
been used. Non-type IIS restriction enzymes could also be used if it is acceptable to have 
their recognition site as part of the gene fragment that is being inserted into 

^ Three micrograms of pDIMC8-MBP(164-165) was digested with 6 units oiSapl 
at 37 °C in the presence of 50 mM potassium acetate, 20 mM Tris-acetate 10 mM 
magnesium acetate, 1 mM dithiothreitol (pH 7.9), 100 ug/ml BSA for 2 5 hours The 
DNA was desalted using the QIAquick PGR purification kit and repaired with 5 units of 
Klenow at 25 °C for 20 minutes in the presence of 33 micromolar dNTPs 1 GO mM NaCl 
50 mM Tris-HGl, 10 mM MgGk and 1 mM dithiothreitol (pH 7.9). The enzyme was heat 
inactivated by incubation at 75 °C for 20 minutes. Sodium chloride was added WlOO 
mM and ten units of Calf Intestinal Phosphatase was added and the solution was 
incubated for 1 hour at 37 °G. Dephosphoiylation was performed to prevent 
recircularization of the vector without receiving an insert in the subsequent ligation step 
The vector DNA was purified by agarose gel electrophoresis using the QIAquick gel 
purification kit. ^ 
Ligation of inserts into target DNA. Insert DNA (85 ng) comprising the 
circularly permuted BLA was ligated to the prepared target DNA (100 ng) at 22°C 
overnight in the presence of T4 DNA ligase (30 Weiss units) and the Ugase buffer 
provided by the manufacturer in a total volume of 13 ^l. After ethanol precipitation, 
10% of the ligase-treated DNA was electroporated into 50 \x\ Electromax DH5a-E 
electrocompetent cells. Transformed cells were plated on large (248 mm x 248 mm) LB 
agar plate supplemented with 50 pig/ml chloramphenicol (Gm). The naive domain 
insertion library was recovered from the large plate (Ostermeier, Nixon et al. 1999) and 
stored in frozen aliquots. 
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Screening for allosteric enzymes. The libraries were diluted from frozen aliquots 
and plated on LB plates containing different concentrations of ampicillin (Tables 1 and 
2). A number of colonies were picked (Table 4) and grown in LB overnight in 96 well 
plates (0.5 ml/well) in the presence of 1 mM IPTG and 50 ^ig/ml Cm. Next, 50 ^il of 
PopCulture (Novagen) and 2.5 unit of benzonase nuclease was added to each well and 
incubated for 15 minutes at room temperature to lyse the cells. The cells debris and any 
unlysed cells were pelleted by centrifugation and supernatant was recovered. In 96-well 
format, 60 of lysate was assayed for hydrolysis of nitrocefm (50 jaM) by monitoring 
the increase in absorbance at 490 nm in 100 mM sodium phosphate buffer, pH 7.0, both 
with and without 5 mM maltose. Any lysate in which there was a difference in rate of 
more than 2-fold (between with and without maltose) was selected for retesting and 
fiarther investigation. 

Confirmation and identification of positives. Library members identified as 
having more than 200% switching activity in the 96-well plate screen were grown 24-48 
hours in 100 ml LB media in 500 ml shake flasks at 22°C without IPTG. The cells were 
pelleted and resuspended in 8 ml assay buffer (100 mM sodium phosphate buffer, pH 7.0) 
and lysed by French press. The soluble fraction of this lysate was assayed for hydrolysis 
of nitrocefm (50 \xM) at 22 °C as previously described (Guntas and Ostermeier 2004) 
both with and without 5 mM maltose. Initial rates were determined from absorbance at 
486 nm monitored as a function of time. The enzyme was incubated at the assay 
temperature in the absence or presence of 5 mM maltose for four minutes prior to 
performing the assay. All assays contained 100 mM sodium phosphate buffer, pH 7.0. 
Library members for which there was a difference in the initial rate of more than about 2- 
fold were sequenced (Table 4). Switches RG-5-169 and RG-200-13 were also assayed in 
the presence of 5 mM sucrose or 5 mM glucose. Neither sugar affected the velocity of 
nitrocefm hydrolysis indicating that the switching effect was specific for maltose, a 
ligand to which MB? binds. 

Analysis of Purified Switch RG-200-13. A 6xHis tag was added to the C- 
terminus of RG-200-13 and the fusion was purified as previously described for another 
switch (Guntas and Ostermeier 2004). The protein was purified to approximately 60% 
purity. The kinetic constants and binding constants were determined from Eadie-Hofstee 
plots and Eadie plot equivalents, respectively, usmg a spectrophotometric assay~for 
nitrocefin hydrolysis. Initial rates for nitrocefm hydrolysis were determined from 
absorbance at 486 nm monitored as a function of time. The enzyme was incubated at the 
assay temperature in the absence or presence of saccharide for four minutes prior to 
performing the assay. All assays contained 100 mM sodium phosphate buffer, pH 7.0. 
The dissociation constant for maltose was determined using change in velocity of 
nitrocefin hydrolysis as a signal. 

Only sugars known to bind to MBP had an effect on nitrocefm hydrolysis (Table 
5). Those sugars that produce a large conformational change upon binding MBP 
(Quiocho, Spuriino et al. 1997) (maltose and maltotriose) produced the largest change in 
the velocity of nitrocefin hydrolysis. Beta-cyclodextrin, which produces a small 
conformational change upon binding MBP (Evenas, Tugarinov et al. 2001), has a small 
effect. The effect of maltotetraitol is intermediate, consistent with the fact that 
maltotetraitol-binding to MBP results in a mixture of open and closed structures (Duan 
Hall et al. 2001). 
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The kinetic parameters of RG-200-13 are reported in Table 6. The kinetic 
parameters of RG-200-13 at 22°C in the presence of maltose ()tei,= ~520 s '; = -85 
fxM) are very similar to previously reported values for TEM-1 P-lactamase at 30 °C (/t„,= 
930 s"'; = 52 [xM) (Raquet, Lamotte-Brasseur et al. 1994) indicating that RG-200-13 is 
essentially a fully functional TEM-1 ^-lactamase in the presence of maltose. The k^/K^ 
in the presence of 5 mM maltose is approximately 25-fold higher than in the absence of" 
maltose. The for maltose binding to RG-200-13 at 22°C was ~5 fiM, similar to the 
previously reported for maltose binding to MBP (1-1.5 ^M) (Schwartz, Kellermann et al 
1976). 

The effect of 5 mM maltose on other sustrates of BLA is shown in Table 7. 
Maltose binding had the largest effect on cephalothin (of the substrates tested), with the 
velocity of cephalothin hydrolysis being 32-fold higher in the presence of maltose than in 
its absence. Based on the effects on other substrates, the actual switching effect on A:,,, 
/K^ for cephalothin is likely to be much higher than 32-fold. 

The fact that the magnitude of the switching effect of RG-200-13 is substrate 
identity and concentration dependent strongly argues that maltose is converting the 
protein from a less active to a more active conformation. The alternative explanation that 
maltose affects the equilibrium between unfolded (inactive) and folded (active) forms of 
the protein would result in a switching effect that was independent of the substrate being 
tested (and independent of substrate concentration), which is not the case. 

Applications of the molecular switches obtained by the invention. Protein 
molecular switches are proteins whose activity can be modulated through a signal such as 
the binding of a small molecule, the interaction with another protein or the sensing of 
some other signal (e.g. a change in pH). In other words, a molecular switch functionally 
couples an external signal to functionality. Protein molecular switches engineered by the 
methods of this invention have a wide variety of potential applications (Fig 6) in, for 
example, (a) the regulation of gene transcription, (b) the modulation of cell signaling 
pathways, (c) targeted drug delivery, (d) dmg transport, (e) the creation of conditionally 
active toxic proteins, (f) metabolic engineering, and (g) tfie creation of novel biosensors. 

The BLA-MBP switches have potential as in vitro or in vivo molecular sensors. 
BLA activity can be measure in vivo or in vitro using specially designed fluorescent 
substrate (Zlokamik, Negulescu et al. 1998; Gao, Xing et al. 2003). MBP is a member of 
a class of proteins called periplasmic binding proteins that have been shown to be 
amenable to computational design to alter the proteins so that they bind new ligands 
(Looger, Dwyer et al. 2003). Thus, the BLA-MBP switches potentially can be used to 
detect different ligands in vivo as enzymatic activity would be proportional to ligand 
concentration. 
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Table 1 Library statistics 

Insertion site Linker in Library Number of Number of Number Increase in 

inMBP BLA size (Number library colonies of unique velocity (of 

of members screened for switches nitrocefm 

transformants that can switching found hydrolysis in 

with BLA grow on 50 (see Table 3) with a 2- presence of 

insert). /^g/mlAMP fold maltose) of 

. (see Table 2) effect* best switch 

164-165 DKS 0.44x10* 515 848 2 +97% 

GSGGG 1.05x10* 361 1248 1 -250% 
C-terminus DKS 1.03x10* 2414 576 0 

GSGGG 0.30x10* 1615 1920 1-4 +234% 
random DKS 0.41x10* 191 384 0 
GSGGG 1.20x10* 1156 3312 5 +1650% 



* > 2-fold change in velocity of nitrocefm hydrolysis in the presence of 5 mM maltose. 
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Table 2 Number of Library members that could grow on plates with ampicillin 
(with or without maltose) ^ 



Ampicillin 


Maltose? 


T1 64-1 65 


T1 64-1 65 


EE 


EE 


Random 


Random 


(Mg/ml) 


(5 mM) 


DKS 


GSGGG 


DKS 


GSGGG 


DKS 


GSGGG 


5 


no 


734 


878 


7052 


3510 


nd 


2458 


50 


no 


394 


294 


1747 


1159 


nd 


783 


200 


no 


220 


nd 


1080 


298 


nd 


nd 


1000 


no 


nd 


74 


nd 


nd 


nd 


60 


5 


yes 


1098 


761 


8354 


4056 


nd 


1969 


50 


yes 


515 


361 


2414 


1615 


191 


1156 


200 


yes 


182 


240 


1525 


630 


nd 


272 


1000 


yes 


nd 


88 


nd 


nd 


nd 


34 



EE = end-to-end (insertion at C-terminus); nd = not determined 



Table 3 Number of library members screened (picked from plates with indicated 
ampicillin and maltose levels) 



Ampicillin 


Maltose? 


T1 64-165 


1164-165 


EE 


EE 


Random 


Random 


(H9/ml) 


(5 mM) 


DKS 


GSGGG 


DKS 


GSGGG 


DKS 


GSGGG 


5 


no 




96 




288 




96 


50 


no 














200 


no 












480 


1000 


no 














5 


yes 


96 


192 




864 




768 


50 


yes 


672 


576 


576 


768 


384 


960 


200 


yes 


80 


384 








1008 


1000 


yes 















EE = end-to-end (insertion at C-temiinus) 



Ostermeier and Guntas: Molecular Switches Involving Circular Permutation 

18/26 



Table 4. Selected BLA-MBP Molecular Switches 



Switch Sequence Switching 

— effect * 

IFG-5-277 MBP[ 1 - 1 65]-BLA[2 1 8-286]-GSGGG-BLA[24-2 1 5]-MBP[ 1 64-370] 1250% 

IFD-5-7 MBP[1-165]-BLA[1 10-286]-DKS-BLA[24-107]-MBP[ 164-370] +96% 

IFD-5-15 MBP[l-165]-BLA[168-286]-DKS-BLA[24-170]-MBP[164-370] +970/^ 

EEG-50-530 MBP[l-370]-BLA[l 14-286]-GSGGG-BLA[24-l 12]-GSQQH +228% 

EEG-50-251 MBP[l-370]-BLA[H4-286]-GSGGG-BLA[24-114]-K +234% 

RG-200-151 MBP[1-34I]-A-BLA[1 1 l-286]-GSGGG-BLA[24-l 15]-MBP[347-370] +350% 

RG-200-604 MBP[l-331]-BLA[113-286]-GSGGG-BLA[24-l 12]-L-MBP[333-370] +390% 

RG-5-169 MBP[l-338]-BLA[34-286]-GSGGG-BLA[24-29]-MBP[337-370] +855% 

RG-200-13 MBP[l-316]-BLA[227-286]-GSGGG-BLA[24-226]-S-MBP[319-370] +16 50% 
* Percent change in velocity of nitrocefm hydrolysis (50 nitrocefm) in the presence 
of 5 mM maltose in 100 mM sodium phosphate buffer, pH 7.0. 



Table 5. Sugar dependence of switching effect of RG-200-13*. 



Sugar 


Binds to MBP? 


Change in velocity of nitrocefm hydrolysis in 
presence of sugar 


Sucrose 


No 


-5% 


Lactose 


No 


-4% 


Galactose 


No 


-3% 


Maltose 


Yes 


+1800% 


Maltotriose 


Yes 


+1700% 


Maltotetraitol 


Yes 


+400% 


p-cyclodextrin 


Yes 


+150% 



*50 \iM nitrocefm, 100 mM sodium phosphate buffer, pH 7.0, 22°C, 5 mM sugar except 
for p-cyclodextrin (3mM). 
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Table 6 Kinetic parameters of nitrocefm hydrolysis of RG-200-13 molecular switch. 



Substrate 


ftcat(S-') 


Krn (UM) 


Ratio" 


No 
maltose 


5mM 
maltose 


Ratio" 


No 
maltose 


5mM 
maltose 


Ratio" 


nitrocefin 


-80 


-520 1 -6.5 


-325 


-85 


-0.26 


-25 



"(with maltose)/(without maltose). Conditions: 100 mM sodium phosphate buffer, pH 7 0 



Table 7. Effect of maltose on other substrates of switch RG-200-13 



Substrate 


Substrate 
concentration 


JTm for TEM-1 p- 
lactamase" 


Approximate fold increase 

in velocity of nitrocefin 
hydrolysis in the presence of 
5 mM maltose 


cephalothin 


250 nM 


246 ^M 


32 


ampicillin 


100 JAM 


32fxM 


26 




500 nM 




10 


benzylpenicillin 


100 (xM 


19nM 


17 




500 \xM 




7 


carbenicillin 


1 mM 


? 


4 


oxacillin 


1 mM 


3fiM 


5 



Conditions: 100 mM sodium phosphate buffer, pH 7.0, 22°C. "(Raquet, Lamotte-Brasseur 
et al. 1994) 
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Cm" 



DDNasd (dilute) 

2) Isolate DNA w/ single ds break 

3) T4 DNA polynierase+ dNTPs 
. andT4DNAIigase 

Y4) Alkaline phosphatase 




\)Ear\ 

2) Isolate bla 
fragment 

3) Klenow+dNTPs 



oooooo 

Figure 1 Random Domain Insertion. 
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pBLA^CPCOKS) / 
p8LA-CP(GSGGG) V J 



tacP/0 
IpOIMCfrMBPj 



Cm" 

I UDNasd (dilute) 

2) lsolateONAw/single<lsbreak 

3) T4 DNA polymef ase •<- dNIPs 
I andT4 0NAIigase 

y4)Alkaline phosphatase 



^ Digestionwith Restridton Enzyme / 
— ► 

^ Ligation (dilute ONAconenetration) 

0 



1) 0Nasel(d8ute) 

2) Isolate DNA w/ single ds brealc 

3) T4 ONApolymerase+dNTPs 

andT4DNAIigase 



tacP/O 



p(3IMC8-MBP(I64-165) 



DSapi 

2) Klenow-t-dNTPs 

3) Aikalinephosphatase 



OOOOO 00000 



Hon of Insett and Random Domain In senior 



Random Grojlar Permutation of Insert and Domain Insertion at SpedficSile 



Figure 2 Creation of domain insertion libraries involving the random circular 
permutation of DNA. 
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o; 



Griactamase 



CGAGMGAGTG^GTCTTTGCGACCAC CaCTMTrCQTAAC<X:TQTTCTCQCT ^CTTCTC ^ 



. 24. 2S. 26. 27. 28. 29. 
' -CACCCAOAAACGCTCGTO-- 
' - GGTCTTTGCGACCAC - - 



- -GACTAATTCGTAACCCTQTTCTCGGTG 



.282 .284 286. D K S 24 25 26 27 28 29 

- CTGATTAAGCATTGGGACAAGAGCCRCCCAGAjJkCGCTGGTG 

- GACTAATTCGTAACCCTCTTCTCOOTCSGGTCTTTGCGACCAC I ~ ~ 



Figure 3 Creation of a cyclized beta-lactamase gene with a DKS linker. 
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pBLA-CP(GSGGG) 



O" 



^lactamase 
BamHI 



&-iactamase a.a. 

. 24. 25. 26. 27. 28. 29 
3CGIGTGGCCACCCAGAAACGCTC3GTG- ■ 
^ CCTAG^ CGCCACCGGTGGQTCTTrQCGftCCRC - - 

BamHI 




GCCGCCACCGGTGGGTCTTTGCOACCAC GACTAATTCGTAACCXrCTAO 



Figure 4 Creation of a cyclized beta-lactamase gene with a GSGGG linker. 
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■Q 



pi||Pl Maltose Binding Protein 



pDIMC8-MBP(164-165) 



R/laltose Binding Protein Amino Add 

-CltiArtGCTtoclGAC^GAAGAGCGCTCOAGGcScl^^ 
-GACTAAC(»CGACTGCCCjV£12CIS:ffi:GAGCTCCGAGAAGIGCTGbcCCCAATAC( 

Sapl ^ 



digest w/ Sap! 

isolate desired fragment 



^TtiATrfGCf GcIgAC' ^ ^ ^ 
— GACTAACGACGACTGCCC 



GACrGGCS'GGrt'ArGC(SrTCr~. 
CCCCCAATACGCAAG- 



Klenow and dNTPs 



— CTdATfcCT<K'laACGGd 
— GACTAACGACGACTGCCC 



164 166 166 

GACteGtfGGTrAfGCCStTCf, 

CTGCCCCCAATACGCAAi 



Figure 5 Prepartion of target DNA for insertion of DNA at a specific location (in 
case between MBP [1-165] and MBP [164-370]) 
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I a) Gene Transalption j 



.Hi 



I b) Signal 1iranscluctiorr| 

m 



i 



[ Apoptosls. Differentiation, etc] 
I c) Targeted Drug Delivery | 




surface protein unique or 
present at higher level on 
diseased cell or pathogen 



<^ dnig 



^ drug 



all molecule, protein or 
er signal (e.g. pH 
I) Inside (xll 



protein with domain that Is 
taken up by cells (e.g, 
transferrin) 



I e) CondWonally-ecttva taric protatiw] 

A 

healthy cell diseased cell or pathogen 



Inactive form of torfn 
active form of toxin 



I f) Metabolic Englneoring | 



Figure 6 Application of Molecular Switches, (a) Regulation of gene 
transcription. A DNA binding protein is functionally coupled to a protein that binds □ 
such that the protein releases from the DNA in the presence of □ . The signaling 
molecule □ , which could be a naturally occurring in the cell or supplied as a drug, could 
then be used to induce expression of an exogenously supphed therapeutic or toxic protein 
or to repress/induce expression of endogenous genes in order to restore normal cell 
function, (b) Modulation of cell signaling pathways. Cell fates can be regulated by 
engineering a protein involved in a cell signaling pathways to be conditionally active in 
the presence of the signal □ , which could be a naturally occurring small molecule, a 
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protein or an administered drug, (c) Targeted drug delivery. A drug binding protein can 
be functionally coupled to a protein that binds some small molecule or protein that is a 
signature of a disease (e.g. a surface protein). The drug binding protein releases the drug 
only in the presence of the molecular signature of the disease, (d) Drug transport. A 
drug binding protein can be fused to a protein that is taken up by cells in order to 
transport the drug into cells. The two proteins are functionally coupled such that the 
conformational change in the transport protein upon interaction with □ inside the cell is 
coupled to the release of the drug, (e) Conditionally active toxic proteins. A toxic 
protein is engineered such that it is nontoxic unless it interacts with □ , which is a small 
molecule or protein signature of a diseased cell or pathogen, (f) Metabolic engineering. 
Just as small molecules in cells can allosterically modulate metabolic pathways, 
engineered molecular switches can control metabolic pathways. This could be achieved 
by functional coupling of an enzyme (whose activity is to be modulated) with a protein 
that binds □ . (g) Biosensors. The key design issue in biosensors is the creation of a 
macromolecule that specifically binds its target ligand and, upon binding, transduces this 
into a signal (such as optical or electrochemical changes) that can be detected 
macroscopically. The functional coupling of a binding protein and a signal transducing 
properties is a general strategy towards engineering new families of biosensors. 
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A hallmark of biological systems is the high degree of interactions amongst and within 
their constituent components. One advantage that such interactions bring is the 
establishment of coupling between different functions. A protein that couples two 
functions can be described as a molecular switch. For example, an allosteric enzyme is a 
switch that couples effector levels (input) to enzyme activity (output). In most general 
terms, a molecular switch couples signals (e.g. ligand binding, protein-protein 
interactions, pH, covalent modification, temperature) to functionality (e.g. enzymatic 
activity, binding affinity, fluorescence). Molecular switches can be of an "on/off nature 
or such that the signal modulates the function between two different levels of activity 
The network of such molecular switches establishes the complex circuits that control 
cellular processes. Protein molecular switches have a wide variety of potential 
applications including the regulation of gene transcription, the modulation of cell 
signaling pathways, targeted drug delivery, drug transport, the creation of conditionally 
active toxic proteins, metabolic engineering, and the creation of novel biosensors. We 
have devised a combinatorial protein engineering algorithm that can create molecular 
switches by the coupling of two proteins' functionalities. We have demonstrated the 
efficacy of the algorithm by creating a set of allosteric enzymes fi-om two unrelated 
proteins with the prerequisite effector-binding and catalytic functionalities, respectively 
In some of these engineered molecular SAvitches, the presence of effector increased 
catalytic activity by more than ten-fold. 
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Described are methods for creating and using molecular switches with different affinities for 
the effector molecule (signal) as well as methods for creating and using molecular switches with 
different specificities for the effector molecule (i.e. it responds to a new signal). 

We have demonstrated a method for creating a molecular switch by a combinatorial 
^proadi of recombining two genes so as to couple tiieir functions. We have demonstrated the 
approach by recombining the gaies for TEMl p-lactamase (BLA) and the E. coli maltose binding 
protein (MBP) such that BLA catalytic activity was a fimction of tfie concentration of maltose. 
The kinetic parameters and affinity for maltose of one of Aese switches (RG13 -referred to in the 
previous provisional as RG-200-13 - which has the amino acid sequence MBP[1-316]-BLA[227- 
286-GSGGG-BLA[24-226-S-MBP[3 19-370]) have beat determined and are shown in Tables 1 and 
2. RG13 has a dissociation c(Histant for maltose is about 5-^ |iM in tiie absence of a BLA substrate. 
In the presence of saturating amounts of the substrate carbaiidllin, the dissociation constant for 
maltose decreases to about 1 yM. This shows that tfie binding of maltose and substrate 
(carbenicillin) are coupled. The coupling energy is on the order of 1 kcal/mol. This is OMisistait 
with a deaease in Km for niti-ocefin in tiie presmce of maltose (Table 2) 

A switch is most useful if the range of the concentration of the signal (maltose, in the case of 
RG13) overlaps witii the range of concentiation of the signal that the dependent function responds 
to. For the case when a ligand-binding protein is used as the signal detector and the ligand is the 
signal, the later range corresponds to approximately to the range 0. l^d - 10 ATa, where is the 
dissociation constant of the switch and the signal. This can be seen from tiie following example: 

In the case of RG 13, the velocity of nitrocefin hydrolysis is the dependent function. The 
velocity (v) of nitrocefin hydrolysis depends on tiie steady state kinetic parameters by the 
Michaelis-Menten (Equation 1). 

where [E]o is the concentration of the switch, [S] is the concentration of nitrocefin and kax and J^m 
are the Michaelis-Menten kinetic parameters. In the absence of maltose, the velocity is found by 
Equation 2 

where the superscript "-" designates that the parameters are those when maltose is not bound to the 
switch. In the presence of saturating concentrations of maltose (i.e. maltose is bound to all 
switches; this occurs at very high concentrations of maltose relative to the dissociation constant 
for maltose), tiie velocity is found by Equation 3 

V.l» (3) 

where the superscript designates tiiat the parameters are those when maltose is bound to the 
switch. At intermediate concentrations of maltose, the velocity depends on the fraction of switches 
that have maltose bound. If we make the approximation that the small cooperative effect of 
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maltose- and substrate-binding can be ignored, the fraction F of switches that are bound to maltose 
can be found by Equation 4. 

[ir,].[Af] 

where [M] is the concentration of maltose. The velocity of nitrocefin hydrolysis is thus found by 
" ' ii5 



Equation 5 is true for all concentrations of maltose as it reduces to Equations 2 and 3 in the limiting 
cases of no maltose bound and saturating maltose, respectively. The fold-increase in the rate of 
nitrocefin velocity Z is found by dividing the right hand side of Equation 5 by the velocity in the 
absence of maltose to get Equation 6 



(6) 



Equation 6 is plotted in Figure 1 for the case of RG13 hydrolysis of 25 iiM nitrocefm using a range 
of different dissociation constants for maltose. One can see that the velocity is dianging most in the 
range of one order of magnitude higher or lower than the dissociation constant for maltose. The 
switdi will have the largest change in the dep^dent function if the concentration of the signal (in 
the case of RG13 this is maltose) changes within this range or changes through this range. Hius, it 
is desirable for the application of molecular switches to create svwtches with different aflSnities for 
the signal so as to be useful for different concentration ranges of the signal. 

In one aspect of the invention, switdies created by the mettiod have differait affinities for 
maltose. For example switch RG-5-169 (sequence MBP[l-338]-BLA[34-286]-GSGGG-BLA[24- 
29]-MBP[337-370] has &Ki for maltose (> 1 mM) that is much greater than RG13 has for maltose 
(1-5 fiM). 

In another aspect of the invoition, the effector-affinity of switches is altered by a variety of 
metfiods including rational design and directed evolution methods. As long as the resulting altered- 
afOnity switch maintains a conformational change upon binding the effector that results in changes 
the dependait function, switching will be maintained. 

In another aspect of the invraition mutations already known to alter the afSnity of the ligand 
recognition domain (for RG13 this is MBP) are introduced into flie switch to create switches widi 
altered affinity for the ligand. These mutations consist of those that make direct contact with tiie 
ligand, those that make contact with residues that make direct contact with the hgand and those that 
are more distal from the binding site pocket For example, Marvin and Hellinga (Marvin and 
Hellinga 2001) have described a set of mutations in the hinge region of MBP that manipulate the 
conformational equilibria between the open and closed state (Marvin and Hellinga 2001). Residual 
dipolar couplings have been used to establish that the ^o forms of these mutants are partially 
closed relative to the apo wildtype MBP with the closure angles being 9.5** and 28.4° for I329W and 
A96W/I329W, respectively (the ligand-bound closed form of MBP has a closure angle of 35°) 
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(Millet, Hudson et al. 2003). Because partial closing shifts the equilibrium towards the ligand- 
bound state, the I329W mutation results in about a 20-fold increase in affinity for maltose and the 
A96W/I329W double mutant results in a 60-fold increase in the affinity for maltose compared to 
wildtype MBP at 25°C (Marvin and Hellinga 2001). The affinities of MBP, MBP(I329\V) and 
MBPa329W/A96W) are 800 nM, 35 nM and 13 nM, respectively. Introduction of these mutations 
into RG13 results in mutants with increased affinity for maltose (Table 1) v\*ile still maintaining 
switdiing behavior (Table 2 and Figure 2). In addition, tiie level of activity in the presence of 
saturating amounts of maltose (the "on" state) was not affected by the mutations (Table 1). From a 
practical standpoint, the increase in maltose affinity of fliese hinge mutants indicates that ligand- 
affinity of RG13 can be systematically changed to create molecular switches that respond to 
differait concoitration ranges of effector yMle still maintaining switching ability and high activity 
in flie presence of the effector. By increasing flie affinity for maltose one increases the sensitivity of 
the switch (i.e. it will switch to a higher level of activity at lower concaitrations of maltose). In a 
further aspect of the invention, combinations of these affinity-altered switches behave as a 
composite switdi with a hig^ dynamic range. 

Tlie invention also consists of methods to alter the specificity of the signal recognition 
domain so that it recognizes other signals. This would allow the constiiiction of new molecular 
switches in which the dependent function responded to new signals without requiring tiie 
construction of aitirely new molecular switches. For die example of RGB, vdiere the signal 
binding domain is die maltose binding protein, fliese metiiods can diange the ligand to which tiie 
switdi binds to. This would allow the construction of molecular switches in which BLA activity 
could be modulated by different ligands witiiout developing entirely new molecular switches. Thus, 
in a further aspect of the invention, the idaitity of tiie signal to whidi flie switch responds can be 
altered by mutations in existing switches. For example, mutations in tiie signal recognition domain 
already known to alter flie ligand-binding specificity can be inti-oduced into the switdi to create 
switdies tiiat respond to new ligands. For example, Hellinga and colleagues have computationally 
designed periplasmic binding proteins witii radically different binding specificities (Looger, Dwyer 
et al. 2003) including designing MBP to bind Zn^"^ (Marvin and Hellinga 2001) instead of rnaltose. 
MBP binds maltose witii high affinity (lU = 0.8 \iM) but does not bind Zn^^. MBP witfi tiie A* set 
of mutations (A63H/R66H/Y155EAV340E) has high affmity for Zn^^ {Ki = 5.1 \iM) and does not 
bind maltose (Marvin and Hellinga 2001). Introduction of tiie A* set of mutations into RG13 
results in a switch tiiat responds to Zn^* but not maltose. 

In a fiirtiier aspect of die invention, die signal recognition domain is altered by rational 
design or directed evolution to bind to new effectors. Wifli respect to testing mutations predicted by 
rationale design or screening or selecting libraries created for a directed evolution approach, 
existing switches are used to efficiently test or select for binding to new ligands in vivo. For 
example, E. colt cells expressing die MBP-BLA switch RG13 from the lac promoter on pDIMCS 
have a higher MIC for ampicillin (Amp) in tiie presence of maltose tiian in tiieir absence (Table 3) 
because tiie BLA enzymatic activity of RG13 (hydrolysis of ampicillin) is higher in tiie presence of 
maltose. Thus, for example, mutations created in RG13 (eitiier by rational design or by a stochastic 
or semi-stochastic metiiod) such tiiat mutant forms of RG13 bind anotiier ligand X (and behave as a 
switch) can be screened or selected for in vivo. E. coli producing such a new switch will grow at 
200 ng/ml Amp in ttie presence of X but not in the absence of X. 

In a fiirther aspect of the invention, ligands tiiat bind to tiie signal recognition domain in a 
different manner have different switching ability. This is demonstrated by tiie fact fliat 
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P-cyclodextrin, which is known to bind to MBP but with a different confonnational change 
(Skrynnikov, Goto et al. 2000; Hwang, Skrynnikov et al. 2001), changes the activity of the RG13 
switdi in a different manner than maltose (see Table 2). 
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Table 1 Hinge Mutations in MBP domain Increase Maltose Affinity in RG13. 



Protein 


Ligand 


Ad maltose QiM)' 


No 
Substrate'' 


25|tM 
nitrocefin" 


Saturating Carbenicillin 


IPF" 


En^g^atic 
assay*" 


RG13 


maltose 


5.5 ±0.5 


6.7 ± 0.03 


1.3 ±0.5 


0.9 ±0.1 


RG13 1329W 


maltose 


0.55 ±0.13 


1.0 ± 0.04 


nd 


0.11 ±0.01 


RG13 I329W/A96W 

"PfMlHifirtTic inn m\4^1^ 


maltose 
or"! <f\ 


nd 


0.17 ±0.02 


nd 


nd 



''Determined by measuring intrinsic protein fluorescence (IPF) as a function of maltose 
concentration. When using IPF at saturating carbaiicillin, a concentration of 10 mM 
carbenicillin was used. 

'Determined by measuring the initial rate of nitrocefin tydrolysis as a function of maltose 
concentration. 25 \tM nitrocefin is well below the of nitrocefin. Thus, most molecules of 
RG13 will not have nitrocefin bound and the effective that is measured is close to what it 
would be in the absence of substrate. 

Determined by measuring the initial rate of carbenicillin hydrolysis as a function of maltose 
concentration. A concentration of 1.5 mM carbenicillin was use4 which is well above the J^m 
of carbenidUin. Thus, most molecules of RG13 will have carbenicillin bound and the Ad that is 
measured is in the presaice of bound substrate (carbenicillin). 



Table 2 Kinetic parameters of nitrocefin hydrolysis'. 



Protein 


Effector 


kcat (S'')" 


Ratio" 


A„ (nM)" 


A„ 
Ratio" 


^ca. /A„" 
(s * |iM-') 


Ratio" 


RG13 




200 ± 40 




550 ± 120 




0.37 ±0.10 




RG13 1329W 




190 ± 30 




350 ±60 




0.54 ±0.11 




RG13 

I329W/A96W 




360 ± 40 




260 ±40 




1.4 ±0.3 




RG13 


maltose 


620 ± 30 


3.1 ±0.6 


68±4 


0.12 ±0.03 


9.2 ±0.7 


25 ±7 


RG13 I329W 


maltose 


590 ± 50 


3.1 ±0.5 


53 ±7 


0.15 ±0.03 


11.0 ±1.8 


20±5 


RG13 

D29W/A96W 


maltose 


530 ± 20 


1.5 ±0.2 


60 ±4 


0.23 ± 0.04 


8.9 ±0.8 


6.4 ± 1.3 


RG13 


p-cyclo' 


590 ± 60 


2.9 ±0.6 


870 ±90 


1.6 ±0.4 


0.67 ±0.10 


1.8 ±0.6 



Conditions: 100 mM sodium phosphate bufifer, pH 7.0, 25*'C; concentration of effector is 5 mM 
(with efifector)/(wilhout effector). 
"P-cyclodextrin 



^1 
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Ta ble 3 Minimum inhibitory concentration of ampidllin forE. co li cells expressing RG13' 



Supplement to plate 


MIC ampicillin (ixg/ml) 


none 


100 


50 [iM maltose 


400 


5 wM maltose 


400 



'conditions: LB plates, 37 **C, siq)plemented with maltose as indicated. Approximately 1000 colony 
forming units (without ampicillin) per plate. Concmtrations of ampicillin tested: 0, 25, 50, 100, 200, 
400 and 800 \ig/ml. 
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maltose concentration (jxM) 

Figure 1 Fold increase in vdocity of switch RG13 witli different dissociation concentrations {Ka) for 
maltose. Concentration of nitrocefin was 25 \kM. The kinetic parameters of RG13 with and without 
maltose are those in Table 2. Equation 6 was used to generate the curves. 



'Molecular S witdies and Nfethods for Making and Using the Same" Maic Ostermeier 



n. o 
£ «) 

i ^ 



3000 



2500 



2000 



1500 - 



1000 



500 




RG13 R(^13 R(^13 

I329W r329WM96W 



Figure 2. Percent improvement in kinetic parameters of nitroceiin hydrolysis of switches in presence 
of maltose. Data taken from Table 1. For k^t and k^JK^, an improvement is an increase in flie value. 
For .^m, an improvement is a decrease in the value. 



no 
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Summary 

We have created a molecular switch by the in vitro recombination of non- 
homologous genes and subjecting the recombined genes to evolutionary pressure. The 
gene encoding TEMl |3-lactamase was circularly permuted in a random fashion and 
subsequently randomly inserted into the gene encoding Escherichia coli maltose binding 
protein. From this library a switch (RG13) was identified in which its p-lactam 
hydrolysis activity was compromised in the absence of maltose but increased 25-fold in 
the presence of maltose. Upon removal of maltose, RG13's catalytic activity returned to 
its pre-maltose level, illustrating that the switching is reversible. The modularity of 
RGI3 was demonstrated by increasing maltose affinity while preserving switching 
activity. RG13 gave rise to a novel cellular phenotype, illustrating the potential of 
molecular switches to rewire the cellular circuitry. 

Running Title: Creation of a molecular switch by recombination 
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Introduction 

The incredible complexity of biological systems derives to a large extent from the 
high degree of interactions amongst their constituent components. As such, the cell is 
often described as a complex circuit consisting of an interacting network of molecules. A 
key component of these networks are protein "switches" that serve to couple cellular 
functions. A switch recognizes an input signal (e.g. ligand concentration, pH, covalent 
modification) and, as a result, its output signal (e.g. enzyme activity, ligand affinity, 
oUgomeric state) is modified. Examples of natural switches include allosteric enzymes 
which couple effector levels to enzymatic activity and ligand-dependent transcription 
factors that couple Ugand concentration to gene expression. The ability to create novel 
switches or to modify existing switches by coupling hitherto imcoupled functions would 
enable the rewiring of the cellular circuitry to om- own design. In addition, the ability to 
create protein switches has tremendous practical potential for developing novel molecular 
sensors, "smart" materials, and as a tool for elucidating molecular and cellular function. 

We sought a general approach for creating switches in which one could select 
from natural or engineered proteins with the desired input and output functions and, by 
combining the proteins in a systematic fashion, create switches in which their functions 
were tightiy coupled. In addition, we sought switches in which the presence of the signal 
resulted in an increase in the output fimction. Existing approaches [1-16] either require 
existing switches as starting points, are limited to specific input functions and 
mechanisms, lack reversibility, or lack practicality in vitro or in vivo. In addition, most 
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existing approaches have not been shown to produce switches with large changes in 
output (>1 0-fold) in response to a signal. 

Our approach involves recombination between two genes encoding the 
prerequisite input and output functions. Such an approach is inspired by the evolutionary 
mechanism of domain recombination [17] - a major facilitator in the natural evolution of 
protein function [18]. Domain recombination strategies have been apphed to the 
construction of switches [2, 7, 10-16], but the very limited manner in which the domains 
were recombined has restricted the success of this approach. We reasoned that a more 
diverse exploration of fusion geometries between two proteins would enable the creation 
of switches with superior properties. The structural space that we sought to explore can 
be conceptualized as "rolling" the two proteins across each others surface and fusing 
them through peptide bonds at the points where their surfaces meet. We developed a 
homology-independent, combinatorial method for recombining genes that samples such a 
structural space. The method involves two basic steps. First, the gene to be inserted is 
circularly permuted in a random fashion to vary the fusion site on the insert gene. 
Second, the library of the circularly permuted gene is randomly inserted into the target 
gene. 

Results and Discussion 

We applied this approach to the recombination of the genes encoding TEM-1 P- 
lactamase (BLA) and the E. coli maltose binding protein (MBP). BLA and MBP lack 
any sequence, structural or functional relationship except for the fact that they are 
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periplasmic proteins of bacterial origin. BLA is a monomeric enzyme that hydrolyzes the 
amide bond of the P-lactam ring of P-lactam antibiotics. The presence of maltose has no 
effect on wild type BLA enzymatic activity, with or without the presence of an equimolar 
amovint of MBP [15]. MBP is a member of the periplasmic binding protein superfamily 
and is involved in chemotactic response and the transport of maltodextrins. MBP 
consists of a single polypeptide chain that folds into two domains connected by a hinge 
region. The single binding site for maltose is at the interface of these two domains. In the 
absence of maltose, MBP exists in an open form. Maltose-binding is concomitant with a 
35° bending motion about the hinge resulting in the closed form of the protein [19]. We 
sought to create a molecular switch by combining BLA and MBP in such a manner that 
the rate of P-lactam hydrolysis was coupled to maltose-binding and maltose 
concentration. We reasoned that in such a switch the conformational change in the MBP 
domain upon maltose binding would propagate to the active site of the BLA domain and 
alter its catalytic properties, a mechanism analogous to natural allosteric effects. 

The ftagment of the BLA gene coding for the mature protein was circularly 
permuted in a random fashion [20, 21] and subsequently randomly inserted [15] into a 
plasmid containing the E. coli malE gene that codes for MBP (Figure 1 A). For the 
random circular permutation of bla[24-286], we fused the 5' and 3' ends by an 
oHgonucleotide sequence that would result in a GSGGG flexible peptide linker between 
the original N- and C- termini of the protein. This linker was designed to be of sufficient 
length to connect the termini without perturbing BLA structure. Statistical analysis on 
the resulting library indicated that a minimum of 27,000 members contained a circularly 
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permuted bla[24-286] inserted into malE in the correct orientation with both fusion 
points in-frame with malE. Approximately 0.33% of these members were able to form 
colonies on rich media plates containing 200 ng/ml ampicillin and 50 mM maltose. 
These library members were screened in 96-well format for a maltose dependence on p- 
lactamase activity using a colorimetric assay for nitrocefin hydrolysis. We identified one 
protein (RG13; Figure IB) in which the initial velocity of nitrocefin hydrolysis (at 50 
nitrocefin) increased by 17-fold in the presence of maltose. In RG13, the BLA was 
circularly permuted in a loop that precedes a P-sheet that lines the active site of the 
enzyme. The circular permuted BLA was inserted at the beginning of an a-helix of MBP 
such that two MBP residues were deleted (Figure IC). 

Using purified RG13, we confirmed that the increase in catalytic activity occurred 
only in the presence of sugars that are known to bind and induce a conformational change 
in MBP (Figure 2A). Sugars known to induce a large conformational change [22] 
(maltose and maltotriose; 35° closure angle) produced a 15- to 20-fold increase in the rate 
of nitrocefin hydrolysis. P-cyclodextrin, which only induces a 10° hinge bending motion 
in MBP [24], increased the rate 2-fold. Non-ligands such as sucrose, lactose and 
galactose had no effect. We next determined that the switching was reversible (i.e. upon 
removing maltose, the activity returns to its pre-maltose level). This was demonstrated 
both by competing maltose off RG13 using P-cyclodextrin (Figure 2B) and by subjecting 
RG13 to repeated rounds of dialysis and addition of maltose to cycle between low and 
high levels of enzymatic activity (Figure 2C). This reversibility is one of the features that 
differentiates our approach from methods such as conditional protein spUcing [8, 16] that 
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produce non-reversible switches that control the production of active protein rather than a 
protein's activity per se. 

From steady state kinetics experiments, we determined RG13's Michaelis-Menten 
parameters for nitrocefin hydrolysis at 25°C in the absence and presence of maltose. In 
the absence of maltose, the catalytic constants were /fccat = 200 ± 40 s"' and K„ = 550 ± 
120 (iM. With the addition of saturating amounts of maltose, A^at increased 3-fold and 
decreased 8-fold, resulting in a 25-fold increase in kcJKm. The kinetic constants of 
RG13 in the presence of saturating concentrations of maltose (^cat = 620 ± 60 s'' and 
= 68 ± 4 nM) were comparable to that previously reported for BLA at 24°C (yfccat = 900 s'^ 
and ^„ = 1 10 nM; [25]) indicating that RG13 is a very active TEMl p-lactamase in the 
presence of maltose. RG13 has exhibited switching behavior with all seven BLA 
substrate tested to date including ampicillin (16-fold rate increase at 50 \xM. ampicillin) 
and carbenicillin (12-fold rate increase at 50 |aM carbenicillin). 

The increase in indicates that maltose binding affects the catalytic steps. 
However, since Km is a combination of the rate constants for substrate binding as well as 
catalysis [26], Km could not be directly used to ascertain the effect of maltose on substrate 
binding. Instead, the effect of maltose on substrate binding was determined indirectly by 
measuring the effect of substrate on maltose binding using intrinsic protein fluorescence. 
These studies suggested that RG13 undergoes a conformational change much like MBP 
does upon maltodextrin-binding since maltose-induced quenching of total fluorescence 
(-10%) and shifting of the maximum fluorescence wavelength (a 1 .5 nm red-shift for 
maltose and a 4 nm blue-shift for p-cyclodextrin) were similar to that previously reported 
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for MBP [27]. The presence of saturating amounts of the substrate carbenicillin decreased 
the dissociation constant of maltose and RG13 from 5.5 ± 0.5 )aM to 1.3 ± 0.5 ^iM; thus, 
maltose binding must decrease the dissociation constant of carbenicillin and RG13 by the 
same factor (Supplementary Figure SI). This corresponds to a coupling energy of 
approximately 1 kcal/mol and offer an additional explanation for the increase in P-lactam 
hydrolysis in the presence of maltose: a positive heterotropic allosteric effect on substrate 
binding. 

Presumably, the BLA domain of the apo, open form of RG13 exists in a 
compromised, less active conformation. In the hgand-bound state, the BLA domain 
exists in a more normal, active conformation. But what is the state of the BLA domain in 
the process of closing? At what closure angle do the catalytic properties of RGB 
improve? To address these questions, we took advantage of mutations in the hinge 
region of MBP that manipulate the conformational equilibria between the open and 
closed state [28]. Residual dipolar couplings have been used to establish that the apo 
forms of these mutants are partially closed relative to the apo wildtype MBP with the 
ensemble average closure angles being 9.5° and 28.4° for I329W and I329W/A96W, 
respectively [29]. The ligand-bound closed forms of MBP, MBP(I329W) and 
MBP(I329W/A96W) have closure angles of 35°. Partial closing shifts the equilibrium 
towards the ligand-boimd state and thus the mutations increase the affinity for maltose 



Introduction of these mutations into RG13 resulted in the creation of more 
sensitive switches-switches that respond to lower concentrations of maltose (Figures 3 A 



[28]. 
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and 3B). The fact that we observed qualitatively similar changes in maltose-affinity when 
the mutations are introduced into RG13 strongly suggests that the relative order and 
magnitude of the angles of closure of RG13, RG13(I329W) and RG13(I329W/A96W) 
are similar to that of MBP, MBP(D29W) and MBP(D29W/A96W). Thus, the apo forms 
of the two RG13 mutants offer conformations intermediate between the open to the 
closed form of RG13-conformations that may reflect that of RG13 in the process of 
closing. Assuming that the process of closing in RG13 passes through the conformations 
of the apo forms of the two RG13 mutants, kinetic characterization of RG13(I329W) and 
RG13(I329W/A96W) suggested that the initial stages of closing do not result in changes 
in the BLA domain that substantially affect catalysis (Figures 3B-D), Both kat and Km 
improved during the intermediate stages of closing but the majority of the effect on 
occurred during the final stages of closing. 

As the magnitude of the allosteric effect was on the same order as that of many 
natural allosteric enzymes, we next examined the biological effects of RG13. The 
switching activity was sufficient to result in an observable phenotype: maltose-dependent 
resistance to ampicillin (Table 1). E. coli cells expressing RG13 had a minimum 
inhibitory concentration (MIC) for ampicillin that was four-fold higher in the presence of 
50 nM maltose. In contrast, the addition of the same concentration of sucrose or glucose 
to a plate did not affect tiie MIC: Thus, RG13 serves to couple the previously unrelated 
functions of ampicillin resistance and maltose concentration. E. coli cells expressing 
RG13 fimction as a growth/no growth sensor for maltose. 
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Significance 

We have shown that two unrelated proteins can be systematically recombined in order to 
link their respective fimctions and create molecular switches. A combination of random 
circular permutation and random domain insertion enabled the creation of a MBP-BLA 
fusion geometry in which conformational changes induced upon maltose binding could 
propagate to the active site of BLA and increase BLA enzymatic activity up to 25-fold. 
The functional coupling of two proteins with no evolutionary or fimctional relationship is 
a powerful strategy for engineering novel molecular function. For example, coupling a 
ligand-binding protein and a protein with good signal transduction properties would 
result in the creation of a molecular sensor for the ligand. Furthermore, switches that 
establish connections between cellular components with no previous relationship can 
result in novel cellular circuitry and phenotypes. We envision, for example, that such 
switches might establish connections between molecular signatures of disease (e.g. 
abnormal concentrations of proteins, metabolites, signaling or other molecules) and 
functions that serve to treat the disease (e.g. delivery of drugs, modulation of signaling 
pathways or modulation of gene expression) and therefore possess selective therapeutic 
properties. 

Supplemental Data 

Supplemental data including experimental procedures and a figure are available at. . . 
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Table 1. Ampicillin resistance ofE. coli cells in the presence and absence of maltose. 





Minimum Inhibitory Concentration of 


Expressed 


Ampicillin (ng/ml)* 


Protein 


No maltose 


50 \xM maltose 


none 


4 


4 


RG13 


128 


512 


BLA(W208G)t 


32 


32 


BLA 


^000 


^000 



*Conditions: DH5a-E cells on LB plates (with or without maltose) incubated at 37°C for 20 
hours. 

fA mutant of BLA with reduced activity. 
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Figure Legends 

Figure 1. Creation of MBP-BLA molecular switches by non-homologous recombination, 

(A) The fragment of the BLA gene coding for the mature protein (codons 24-286) is flanked 
by sequences coding for a GSGGG linker (each of which contains a BamHl site). The 
fragment is excised by digestion with BamHl and cycUzed by ligation under dilute DNA 
concenfrations. A single, randomly-located double strand break is introduced by DNasel 
digestion to create the circularly permuted library. This library is randomly inserted into 
plasmid pDIMC8-MBP containing the MBP gene (malE) under control of the tac promoter 
(tacP/O). The site for insertion in pDIMC8-MBP is created by introduction of a randomly 
located double-stranded break by digestion with dilute concentrations of DNasel. 

(B) Sequence of switch. Regions derived from MBP are shown in blue and those from BLA 
are shown in red. The GSGGG linker and a serine derived from the fusion of partial codons 
are shown in black. The number in parenthesis indicates the amino acid number of the 
starting proteins. The mmibering system for MBP does not include the signal sequence. The 
numbering system for BLA does include the signal sequence and does not follow the 
consensus numbering system for P-lactamases. 

(C) Structures of maltose-bound MBP [22] and BLA bound to an active-site inhibitor [23] 
oriented such that the fusion sites in RG13 are proximal. Dark blue, MBP[1-316]; yellow, 
BLA[227-286]; white, GSGGG Unker; red, BLA[24-226]; green, MBP[3 19-370]; light blue, 
MBP deleted residues 317 and 318. 

Figure 2. Switch activity in RG13 is specific to ligands of MBP and is reversible. 
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(A) The percent increase in tiie initial velocity of nitrocefin hydrolysis at 20 ^iM nitrocefin 
upon addition 5 mM of the indicated ligands (maltose, maltotriose and P-cyclodextrin) and 
non-ligands (sucrose, lactose and galactose). 

(B) Reversible switching using competing ligand. During the enzymatic hydrolysis of 
nitrocefin, formation of product is monitored by absorbance at 486 nm. At time zero the 
reaction is started in 2 ml phosphate buffer (0.1 M) with 20 )aM nitrocefin and 2.5 nM RG13. 
At the time indicated by the first arrow, 20 p,l of 1 M maltose was added resulting in a 10- 
fold increase in the reaction rate. This maltose concentration is above the for maltose but 
is subsaturating. At the time indicated by the second arrow, 230 f^l of 10 mM p-cyclodextrin 
was added (final concentrations are 1.0 mM P-cyclodextrin and 8.9 \jM maltose). Because 
RG13 has similar affinities for maltose and p-cyclodextrin but P-cyclodextrin is present at a 
>100-fold higjier concentration, the P-cyclodextrin preferentially replaces the maltose bound 
to RG13 and the rate of reaction decreases to a level consistent with P-cyclodextrin's modest 
effect on nitrocefin hydrolysis. 

(C) Reversible switching after dialysis. The initial rate of nitrocefin hydrolysis at 25 joM 
nitrocefin was measured at the indicated steps. Maltose was added to a final concentration of 
5mM. 

Figure 3. Characterization of switches. 

(A) Dissociation constants for maltose were determined in the absence (white bars) and 
presence (black bars) of saturating concentrations of carbenicillin. The apparent dissociation 
constants in the presence of subsaturating concentrations (25 |aM) of nitrocefin (grey bars) 
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were also deteraiined. The dissociation constants for maltose of MBP, MBP(B29W), 
MBP(I329W/A96W) (dashed line) reported by Marvin and Hellinga [28] are shown for 
comparison. 

(B) - (D) Steady state kinetic parameters of nitrocefin hydrolysis for RG13, 
RG13(I329W) and RG13(I329W/A96W) in the presence (black bars) or absence (white 
bars) ofsaturating concentrations of maltose. Experimental conditions: 100 mM sodium 
phosphate buffer, pH 7.0, 25°C. 
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Figure 2 
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Supporting Data for Guntas. Mitchell, and Ostermeier. 

Experimental Procedures 
Supplemental Figure SI 

Experimental Procedures 
Materials 

All restriction enzymes, T4 DNA ligase, T4 DNA polymerase, and calf intestinal 
phosphatase were purchased from New England Biolabs (Beverly, MA). pGEM T-vector 
cloning kit and Taq polymerase were purchased from Promega (Madison, WI). DNAsel 
was purchased from Roche Biochemicals (IndianapoUs, IN). Qiaquick PGR purification 
kit and Qiaquick gel extraction kit were purchased from Qiagen (Valencia, CA). 
Popculture reagent, rLysozyme, benzonase nuclease, and His-tag protein purification kit 
were purchased from Novagen (Madison, WI). Oligonucleotides and Electromax DH5a- 
E electrocompetent cells were purchased from Invifrogen (Carlsbad, CA). Nitrocefin was 
purchased from Oxoid (Hampshire, UK). Maltotriose and P-cyclodextrin were purchased 
from Sigma (St. Louis, MO). Antibiotics, maltose, lactose, galactose and sucrose were 
purchased from Fisher Scientific (Pitsburgh, PA). 

Random circular permutation. 

The portion of the bla gene encoding the mature BLA was fused to a sequence coding 
for a GSGGG linker and containing a BamHl site by PGR ampUfication using the 
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forward primer S'-TTTGCC GGATCCGGCGGTGGC CACCCAGAAACGCTGGTG-^' 
and the reverse primer 5'- GTCTGAGGATCCCCAATGCTTAATCAGTGA-3'. 
Portions of the primers encoding the GSGGG linker are underlined and the BamHl site is 
highlighted in bold. The PGR product was desalted using Qiaquick PGR purification kit 
and Hgated to the pGEM T-vector to create plasmid pGEMT-BLA. One hundred and fifty 
Hg of pGEMT-BLA was digested with 1000 units of BamHl and the DNA fi-agment that 
encodes BLA was gel purified using Qiaquick gel purification kit. Eighteen ^g of this 
DNA was cyclized by Ugation at 16°C for 18 hours in a reaction volmne of 5.1 ml in the 
presence of ligase buffer (50 mM Tris-HCl, 10 mM MgCb, 10 mM DTT, 1 mM ATP, 25 
(ig/ml BSA pH 7.5) and 600 Weiss units of T4 DNA Ugase. After heat inactivation of the 
Ugase, the concentrated reaction mixture was desalted and the circular DNA was purified 
by agarose gel electrophoresis using Qiaquick Gel Extraction kit. To introduce the 
random double stranded break, 8 ^ig of circular DNA was digested with 8 milliunits of 
DNAse I in the presence of 50 mM Tris-HCl, pH 7.4, 10 mM MnCh and 50 ng/ml BSA 
in a total volume of 0.8 ml for 8 minutes. The reaction was quenched by the addition of 
EDTA to a concentration of 5 mM and the solution was desalted using a Qiaquick PGR 
purification column. Nicks and gaps were repaired by incubating at 12°C for 30 minutes 
in a total volume of 90 }xl in the presence of T4 DNA polymerase (6 units) and T4 DNA 
ligase (12 Weiss units) in the presence of 50 mM Tris-HCl, pH 7.5, 10 mM MgCl2, 10 
mM DTT, 1 mM ATP, 25 |xg/ml BSA and 125 pM dNTPs. The DNA was desalted as 
before and the linear DNA (corresponding to the randomly circularly permuted bid) was 
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isolated from circular forms by agarose gel electrophoresis using the Qiaquick gel 
purification kit. 

Random Domain Insertion. 

Plasmid pDIM-C8MalE has the malE gene encoding MBP under the IPTG inducible tac 
promoter. Introduction of a random double stranded breaks (one per molecule of pDIM- 
CSMalE) was performed as described [1]. One hundred ng of randomly linearized 
plasmid pDEMCS-MalE was ligated to 85 ng of randomly circularly permuted BLA 
fragment (5:1 insert/vector molar ratio) in a reaction volume of 15 |j,l. The hgation was 
carried out at 22°C overnight in the presence of ligase buffer and 45 Weiss units T4 DNA 
ligase. After ethanol precipitation, the ligated DNA was transformed into Electromax 
DH5a-E electrocompetent cells by performing ten electroporations of 40 jal cells each. 
Cells were plated on two 245x245 mm LB agar plates supplemented with 50 [ig/ml 
chloramphenicol and incubated at 37°C overnight. The naive library was recovered from 
the large plates and stored in frozen aUquots as described [2]. 

Library Selection and Screening 

The naive library was plated on LB agar plates supplemented with 200 ^ig/ml ampicillin 
and 50 mM maltose and incubated at 3TC overnight. From these plates, 1056 colonies 
were picked to inoculate 1 ml LB media (supplemented with 50 ^ig/ml chloramphenicol 
and 1 mM IPTG) in 96-well format. After incubation overnight at 37°C, each culture was 
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lysed using 0.1 ml Popculture reagent, 40 units of rLysozyme, and 2.5 units of benzonase 
nuclease. Lysates were centrifuged to pellet the insoluble material and the soluble 
fractions were assayed in 96-well format for nitrocefm hydrolysis in the presence or 
absence of 5 mM maltose using a colorimetric assay for nitrocefin hydrolysis [3]. The 
assays were carried out at room temperature using the Spectramax-384 Plus microplate 
reader (Molecular Devices) in the presence of 100 mM sodium phosphate buffer and 50 
[iM nitrocefin in a 200 ^il reaction volume. Clones whose lysates exhibited a greater than 
2-fold increase in the rate of nitrocefin hydrolysis were recultured and their lysates 
assayed again to verify the effect. 

Protein Modifications and Mutagenesis 

A GGSGHg sequence was appended to the sequence of RG13 by PGR amplification with 
the appropriate primers. The PGR product was cloned between Ndel and ATioI sites of 
pET24b (Novagen) to create pET24b-RG13. Mutations I329W and A96W were 
introduced into pET24b-RG13 by a combination of overlap extension PGR and 
Quickchange mutageneis. 

Protein Purification 

One liter LB media containing 50 |ig/ml kanamycin was inoculated with 2% overnight 
culture and shaken at 37 °G. The culture was induced with 1 mM IPTG when the ODeoo 
reached 0.5 and incubated at 22 °G for 16 hours. Pelleted cells were resuspended in 20 ml 




Guntas et al. 
26 



binding buffer supplied by the His-tag protein purification kit (Novagen, Madison, WI) 
and lysed by French press. The soluble fi:action was recovered and the protein was 
purified using the protein purification kit. Eluted protein was dialyzed at 4 "C against 
three liters of 100 mM sodium chloride, 50 mM sodium phosphate buffer overnight 
followed by dialysis against one liter of the same buffer with 20% glycerol for four hours. 
Protein was stored in aUquots at -80°C. RG13 and RG13(I329W) were purified as 
described above. To improve the yield of RG13(A96W/B29W), 10 mM maltose was 
added to the culture at induction. RG13(A96W/I329W) was dialyzed more extensively 
after purification and complete removal of maltose was verified by enzymatic assay on 
successive rounds of dialysis in the presence and absence of maltose. The purities of the 
proteins were estimated by coomassie blue staining of SDS-PAGE gels. The purities of 
RG13, RG13(I329W), and RG13(A96W/I329W) were greater than 98%, 95% and 97%, 
respectively. The extinction coefficients of RG13, RG13(I329W), and 
RG13(A96W/B29W) at 280 nm were calculated [4] to be 126,000; 120,500 and 116,100 
Abs M"^ cm'\ respectively. 

Steady State Kinetics 

All kinetic assays were performed at 25°C in the presence of 100 mM sodium phosphate 
buffer, pH 7.0. Ten )il of enzyme stock was added to 1.59 ml buffer (containing the 
saccharide, if desired). After incubation for 30 seconds, 0.4 ml of 5x substrate was added 
and the absorbance at the appropriate wavelength was recorded using the Cary50 UV- 
VIS spectrophotometer. The wavelength monitored was 486 nm, 240 rnn, and 232 nm for 
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mtrocefin, carbenicillin, and ampicillin respectively. From the initial rate of reaction the 
kinetic constants were determined using Eadie-Hofstee plots. In the absence of maltose, 
the time course of the reaction for RG13, RG13(I329W), and RG13(A96W/I329W) 
displayed a sUght lag in the reaction rate that became more pronounced at higher 
substrate concentrations. The rate data was consistent with a small hysteretic effect [5] 
and not substrate inhibition as preincubation of the enzyme with the substrate for one 
minute prevented the lag from occurring upon addition of more substrate. Therefore, the 
steady state parameters for nitrocefin hydrolysis in the absence of maltose were 
determined by measuring the rate at 1-2 minutes (well after the lag) and correcting the 
substrate concentration by subtracting the amount of substrate hydrolyzed. In all cases 
the extent of reaction at the point the rate was measured was less than 25%. In the 
presence of maltose, no lag was observed. 

Maltose Affinity 

Maltose afSnity for RG13 (in presence and absence of 10 mM carbenicillin) and 
RG13(I329W) (in the absence of substrate) was determined using intrinsic protein 
fluorescence measured on a Photon Technology QuantaMaster QM-4 spectrofluorometer. 
Fluorescence spectra were obtained at 25 °C at different concentrations of maltose in 50 
mM sodium phosphate buffer, pH 7.0, containing 100 mM sodium chloride. The protein 
concentration was 50-100 nM. Excitation was at 280 nm. The quenching in fluorescence 
intensity at 341 nm caused by maltose was used in Eadie-Hofstee equivalent plots to 
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AF 



determine using the following equation: AF = AF^ -Kj — ; where AF is the change 

[^J 

in fluorescence intensity, AFmax is the difference in fluorescence between no maltose and 
saturating amounts of maltose and [L] is the maltose concentration. The fluorescence 
quenching of RG13(I329W/A96W) upon addition of maltose was insufl5cient to 
accurately determine a Ki by this method. The dissociation constant for maltose and 
RG13(I329W) in the presence of saturating carbenicillin (2 mM) was determined by 
measuring the initial rate of carbenicillin hydrolysis as a function of maltose 
concentration. The apparent dissociation constant in the presence of subsaturating 
concentrations of nitrocefin (25 \jM) for all three proteins was determined by measuring 
the initial rate of nitrocefin hydrolysis as a function of maltose concentration. 

In Vivo Characterization of Switches. 

Overnight inoculums of DH5a-E cells expressing RG13, BLA or BLA(W208G) were 
diluted into LB media and plated on LB plates, either in the absence or presence of 50 
\iM maltose, in the presence of increasing amoimts of ampicillin. Ampicillin was present 
in the plates at the following concentrations: 0, 2, 4, 8, 16, 32, 64, 128, 256, 512, and 
2000 fig/ml. Cells were plated at approximately 1000 CFU (no antibiotic) per plate. The 
plates were incubated at 37 "C for 20 hours. The minimum inhibitory concentration 
(MIC) was defined as the lowest ampicillin concentration at which no colonies were 
present, or that at which the nvmiber of colonies present was <1% of the number of 
colonies at the next lowest level of ampicillin. 




Guntas et al. 
29 



References 

1 . Guntas, G., and Ostermeier, M. (2004). Creation of an allosteric enzyme by 
domain insertion, J Mol Biol 336, 263-273. 

2. Ostermeier, M., Nixon, A.E., Shim, J.H., and Benkovic, S.J. (1999). 
Combinatorial protein engineering by incremental truncation. 
Proc.Natl.Acad.Sci.USA 96, 3562-3567. 

3. O'Callaghan, C.H., Morris, A., Kirby, S.M., and Shingler, A.H. (1972). Novel 
method for detection of beta-lactamases by using a chromogenic cephalosporin 
substrate. Antimicrob Agents Chemother 1, 283-288. 

4. Gill, S.C., and von Hippel, P.H. (1 989). Calculation of protein extinction 
coefficients from amino acid sequence data. Anal Biochem 182, 319-326. 

5. Neet, K.E., and Ainslie, G.R., Jr. (1980). Hysteretic enzymes. Methods Enzymol 
64, 192-226. 




Guntas et al. 
30 




L + P + S 



LP + S 
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Figure SI. Coupling of ligand and substrate binding. The change in free energy upon 
protein (P) binding ligand (L) and substrate (S) is the same whether ligand or substrate 
binds first. Adding the free energy changes of the two different paths from L+P+S to 
LPS we see that AG^ + AG^ = AG^ + AGf since the total free energy change is path 
independent. By rearranging this equation to AG^ - AGf = AGj - AG^ we see that the 
left hand side represents the effect that the presence of bound substrate has on Ugand 
binding and the right hand side represents the effect that the presence of bound ligand has 
on substrate binding. The effects must be equal. 
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Summary 

We have created a molecular switch by the in vitro re- 
combination of nonhomologous genes and subjecting 
the recombined genes to evolutionary pressure. The 
gene encoding TEM1 ^-lactamase was circularty per- 
muted in a random fashion and subsequently randomly 
inserted into the gene encoding Escherichia coli malt- 
ose binding protein. From this library, a switch (RG13) 
was identified in which its p-lactam hydrolysis activity 
was compromised in the absence of maltose but in- 
creased 25-fold In the presence of maltose. Upon re- 
moval of maltose, RG13's catalytic activity retumed to 
its premaltose level, illustrating that the switching is re- 
versible. The modularity of RG13 was demonstrated by 
Increasing maltose affinity while preserving switching 
activity. RG13 gave rise to a novel cellular phenotype, 
illustrating the potential of molecular switches to re- 
wire the cellular circuitry. 

Introduction 

The incredible complexity of biological systems derives 
to a large extent from the high degree of interactions 
among their constituent components. As such, the ceil 
is often described as a complex circuit consisting of an 
interacting network of molecules. A key component of 
these networks are protein "switches" that serve to cou- 
ple cellular functions. A switch recognizes an input sig- 
nal (e.g., ligand concentration, pH, covalent modifica- 
tion), and, as a result, its output signal (e.g., enzyme 
activity, ligand affinity, oligomeric state) is modified. Ex- 
amples of natural switches include allosteric enzymes that 
couple effector levels to enzymatic activity and ligand- 
dependent transcription factors that couple ligand con- 
centration to gene expression. The ability to create novel 
switches or to modify existing switches by coupling 
hitherto uncoupled functions would enable the rewiring 
of the cellular circuitry to our own design. In addition, 
the ability to create protein switches has tremendous 
practical potential for developing novel molecular sen- 
sors, "smart" materials, and as a tool for elucidating 
molecular and cellular function. 

We sought a general approach for creating switches 
in which one could select from natural or engineered 
proteins with the desired input and output functions 
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and, by combining the proteins in a systematic fashion, 
create switches in which their functions were tightly 
coupled. In addition, we sought switches in which the 
presence of the signal resulted in an increase in the 
output function. Existing approaches [1-16] either re- 
quire existing switches as starting points, are limited to 
specific input functions and mechanisms, lack revers- 
ibility, or lack practicality in vitro or in vivo. In addition, 
most existing approaches have not been shown to pro- 
duce switches with large changes in output (>1 0-fold) 
in response to a signal. 

Our approach involves recombination between two 
geiies encoding the prerequisite input and output func- 
tions. Such an approach is inspired by the evolutionary 
mechanism of domain recombination [1 7], a majorfacili- 
tator in the natural evolution of protein function [1 8]. 
Domain recombination strategies have been applied to 
the construction of switches [2, 7, 10-16], but the very 
limited manner in which the domains were recombined 
has restricted the success of this approach. We rea- 
soned that a more diverse exploration of fusion geome- 
tries between two proteins would enable the creation of 
switches with superior properties. The structural space 
that we sought to explore can be conceptualized as 
"rolling" the two proteins across each other's surface 
and fusing them through peptide bonds at the points 
where their surfaces meet. We developed a homology- 
independent, combinatorial method for recombining 
genes that samples such a structural space. The method 
involves two basic steps. Rrst, the gene to be inserted is 
circularly permuted in a random fashion to vary the fusion 
site on the insert gene. Second, the library of the circularly 
pemnuted gene is randomly inserted into the target gene. 



Results and Discussion 

We applied this approach to the recombination of the 
genes encoding TEM-1 p-lactamase (BLA) and the 
£ coli maltose binding protein (MBP). BLA and MBP 
lack any sequence, structural, or functional relationship 
except for the fact that they are periplasmic proteins 
of bacterial origin. Bl^ is a monomeric enzyme that 
hydrolyzes the amide bond of the p-lactam ring of 
p-lactam antibiotics. The presence of maltose has no 
effect on wild-type BLA enzymatic activity, with or with- 
out the presence of an equimolar amount of MBP [15]. 
MBP is a member of the periplasmic binding protein 
superfamily and is involved in chemotactic response 
and the transport of maltodextrins. MBP consists of a 
single polypeptide chain that folds into two domains 
connected by a hinge region. The single binding site for 
maltose is at the interface of these two domains. In 
the absence of maltose, MBP exists in an open form. 
Maltose binding is concomitant with a 35° bending mo- 
tion about the hinge, resulting in the closed form of the 
protein [1 9]. We sought to create a molecular switch by 
combining BI_A and MBP in such a manner that the rate 
of p-lactam hydrolysis was coupled to maltose binding 
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Figure 1 . Creation of MBP-BLA Molecular Switches by Nonhomolo- 
gous Recombination 

(A) The fragment of the BU\ gene coding for the mature protein 
(codons 24-286) is flanked by sequences coding for a GSGGG linker 
(each of which contains a BamHI site). The fragment is excised by 
digestion with BamHI and cyclized by ligation under dilute DNA 
concentrations. A single, randomly located double-strand break is 
Introduced by DNasel digestion to create the circularly pennuted 
library. This library is randomly inserted into plasmid pDIMCS-MBP 
containing the MBP gene (ma/E) under control of the ta 
(tacP/0). The site for Insertion in pDIMCS-MBP is ci 
duction of a randomly located double-strai 
with dilute concentrations of DNasel. 

(B) Sequence of switch. Regions derived from MBP ai 
blue, and those from BLA are shown in red. The GSGGG linker and 
a serine derived from the fusion of partial codons are shown in 
black. The number in parentheses Indicates the amino acid number 
of the starting proteins. The numbering system for MBP does not 
include the signal sequence. The numbering system for BLA does 
include the signal sequence and does not follow the c 

g system for |3-lactamases. 

P [22] and BLA b< 



and maltose concentration. We reasoned that in such 
a switch the conformational change in the MBP domain 
upon maltose binding would propagate to the active 
site of the BLA domain and alter its catalytic properties, 
a mechanism analogous to natural allosteric effects. 

The fragment of the BLA gene coding for the mature 
protein was circulariy permuted in a random fashion 
[20, 21] and subsequently randomly Inserted [15] Into a 
plasmid containing the E. coli malE gene that codes for 
MBP (Figure 1A). For the random circular permutation 
of bla[24-286J, we fused the 5' and 3' ends by an oligonu- 
cleotide sequence that would result in a GSGGG flexible 
peptide linker between the original N and C termini of 
the protein. This linker was designed to be of sufficient 
length to connect the termini without perturbing BLA 
structure. Statistical analysis on the resulting library indi- 
cated that a minimum of 27,000 members contained a 
circularly pennuted bla[24-286] inserted into ma/E In the 
correct orientation with both fusion points in-frame with 
malE. Approximately 0.33% of these members were able 
to fonn colonies on rich media plates containing 200 ^^g/ 
ml ampicillin and 50 mM maltose. These library members 
were screened in 96-well format for a maltose depen- 
dence on p-lactamase activity using a colorimetric 
assay for nitrocefin hydrolysis. We Identified one protein 
(RG1 3; Figure 1 B) in which the initial velocity of nitrocefin 
hydrolysis (at 50 |xM nitrocefin) increased by 17-fold in 
the presence of maltose. In RG1 3, the BLA was circularly 
pennuted in a loop that precedes a p sheet that lines 
the active site of the enzyme. The circular-permuted 
BLA was inserted at the beginning of an a helix of MBP 
such that two MBP residues were deleted (Figure 1C). 

Using purified RG13, we confirmed that the Increase 
in catalytic activity occurred only In the presence of 
sugars that are known to bind and Induce a confonna- 
tionai change in MBP (Figure 2A). Sugars known to 
induce a large conformational change [22] (maltose 
and maitotrlose; 35° closure angle) produced a 15- to 
20-fold Increase In the rate of nitrocefin hydrolysis, 
(3-cyclodextrin, which only induces a 10° hinge bending 
motion in MBP [24], increased the rate 2-fold. Nonll- 
gands such as sucrose, lactose, and galactose had no 
effect. We next determined that the switching was re- 
versible (i.e., upon removing maltose, the activity returns 
to Its premaltose level). This was demonstrated both 
by competing maltose off RQ13 using p-cyclodextrin 
(Figure 2B) and by subjecting RG13 to repeated rounds 
of dialysis and addition of maltose to cycle between low 
and high levels of enzymatic activity (Figure 2C). This 
reversibility is one of the features that differentiates our 
approach from methods such as conditional protein 
splicing [8, 16], which produce nonreversible swntches 
that control the production of active protein rather than 
a protein's activity per se. 

From steady-state kinetics experiments, we deter- 
mined RG13's Michaelis-Menten parameters for ni- 
trocefin hydrolysis at 25°C in the absence and presence 



active-site inhibitor [23] oriented such that the fusion sites in RG13 
are proximal. Dark blue, MBP[1-316]; yellow, BLA[227-286]; white, 
GSGGG linker red, BLA[24-22G]; green, MBP[31 9-370]; light blue, 
MBP deleted residues 317 and 318. 
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Figure 2. Switch Activity In RG1 3 Is Specific to Ugands of MBP and 
Is Reversible 

(A) The percent increase In the Initial velocity of nitrocefin hydrolysis 
at 20 (iM nitrocefin upon addition 5 mM of the Indicated ligands 
(maltose, maltotriose, and p-cyclodextrin) and nonligands (sucrose, 
lactose, and galactose). 

(B) Reversible switching using competing ligand. During the enzy- 
matic hydrolysis of nitrocefin, fonnation of product is monitored by 
absorbance at 486 nm. At time zero, the reaction is started in 2 ml 
phosphate buffer (0.1 M) with 20 [M nitrocefin and 2.5 nM RG13. 
At the time indicated by the first arrow, 20 |il of 1 M maltose was 
added, resulting in a 1 0-fold increase in the reaction rate. This malt- 
ose concentration is above the for maltose but Is subsaturatlng. 
At the time Indicated by the second anrow, 230 |tl of 10 mM 
3-cyclodextrin was added (final concentrations are 1.0 mM 
^-cyciodextrin and 8.9 )lM maltose). Because RG1 3 has similar affin- 
ities for maltose and p-cyclodextrin but p-cyciodexbin Is present at 
a >1 00-fold higher concentration, the p-cyclodextrin preferentially 
replaces the maltose bound to RG13, and the rate of reaction de- 
creases to a level consistent with ^-cyclodextrin's modest effect 
on nitrocefin hydrolysis. 

(C) Reversible switching after dialysis. The inftial rate of nitrocefin 
hydrolysis at 25 |iM nitrocefin wi 
Maltose was added to a 



of maltose. In the absence of maltose, the catalytic con- 
stants were = 200 ± 40 s"' and K„ = 550 ± 120 
)iM. With the addition of saturating amounts of maltose, 
kut increased 3-fold and K„ decreased 8-fold, resulting 
in a 25-fold increase in /fct/K™. The kinetic constants of 
RG13 in the presence of saturating concentrations of 
maltose t)<^ = 620 ± 60 s"' and K„ = 68 ± 4 jjiM) were 
comparable to that previously reported for BLA at 24°C 
(fccat = 900 s ' and K„ = 110 ^M; [25D, indicating that 
RG1 3 is a very active TEM1 ^-lactamase in the presence 
of maltose. RG13 has exhibited switching behavior with 
all seven BLA substrates tested to date, including ampi- 
cillin (16-fold rate increase at 50 |xM ampicillin) and car- 
benicillin (1 2-fold rate increase at 50 p.M carbenicillin). 

The increase in kca indicates that maltose binding af- 
fects the catalytic steps. However, since K„ is a combi- 
nation of the rate constants for substrate binding as 



Figure 3. Characterization of Switches 

for maltose were determined in the ab- 
(whlte bars) and presence (black bars) of saturating concen- 
trations of carbenicillin. The apparent dissociation constants in the 
presence of subsaturatlng concentrations (25 (lM) of nitrocefin (gray 
bars) were also determined. Hie dissociation constants for maltose 
of MBP, MBP(I329W), and MBP(I329W/A96W) (dashed line) reported 
by Marvin and Hellinga [28] are shown for comparison. 
(B-D) Steady-state kinetic parameters of nitrocefin hydrolysis for 
RG1 3, RG1 3(|329W), and RG1 30329W/A96W) In the presence (black 
bars) or absence (white bars) of saturating concentrations of malt- 
ose. Experimental conditions were as follows: 1 00 mM sodium phos- 
phate buffer (pH 7.0), 2S°C. 



well as catalysis [26], K„ could not be directly used to 
ascertain the effect of maltose on substrate binding. 
Instead, the effect of maltose on substrate binding was 
detemriined indirectly by measuring the effect of sub- 
strate on maltose binding using intrinsic protein fluores- 
cence. These studies suggested that RG13 undergoes 
a conformational change much like MBP does upon 
maltodextrin binding, since maltose-induced quenching 
of total fluorescence (~10%) and shifting of the maxi- 
mum fluorescence wavelength (a 1.5 nm red-shift for 
maltose and a 4 nm blue-shift for p-cyclodextrin) were 
similar to that previously reported for MBP [27]. The 
presence of saturating amounts of the substrate carben- 
icillin decreased the dissociation constant of maltose 
and RG13 from 5.5 ± 0.5 iiM to 1.3 ± 0.5 m-M; thus, 
maltose binding must decrease the dissociation con- 
stant of carbenicillin and RG1 3 by the same factor (Sup- 
plemental Figure SI). This corresponds to a coupling 
energy of approximately 1 kcal/mol and offers an addi- 
tional explanation for the increase in 3-lactam hydrolysis 
in the presence of maltose: a positive heterotropic allo- 
steric effect on substrate binding. 

Presumably, the BLA domain of the apo, open fonri 
of RG1 3 exists in a compromised, less active conforma- 
tion. In the ligand-bound state, the BLA domain exists 
in a more normal, active conformation. But what is the 
state of the BLA domain in the process of closing? At 
what closure angle do the catalytic properties of RG13 
improve? To address these questions, we took advan- 
tage of mutations in the hinge region of MBP that manip- 
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■Conditions: DH5a-E celis on LB plates (with or without maltose) 

incubated at 37°C for 20 hr. 

'A mutant of Bl^ with reduced activity. 



ulate the conformational equilibria between the open 
and closed state [28]. Residual dipolar couplings have 
been used to establish that the apo forms of these mu- 
tants are partially closed relative to the apo wild-type 
MBP with the ensemble average closure angles being 
9.5° and 28.4° for I329W and I329W/A96W, respectively 
[29]. The llgand-bound closed forms of MBP, MBP 
(I329W) and MBP(I329W/A96W), have closure angles of 
35°. Partial closing shifts the equilibrium toward the li- 
gand-bound state, and thus the mutations Increase the 
affinity for maltose [28], 

Introduction of these mutations into RG13 resulted in 
the creation of more sensitive switches, switches that 
- respond to lower concentrations of maltose (Figures 3A 
and 38). The fact that we observed qualitatively similar 
changes in maltose affinity when the mutations are intro- 
duced into RG1 3 strongly suggests that the relative or- 
der and magnitude of the angles of closure of RG13, 
RQ1 3(I329W), and RG1 3(I329W/A96W) are similar to that 
of MBP, MBP0329W), and MBP(I329W/A96W). Thus, the 
apo forms of the two RG1 3 mutants offer confomiations 
intermediate between the open to the closed form of 
RG13, confomiations that may reflect that of RG13 In 
the process of closing. Assuming that the process of 
closing in RG13 passes through the confomiations of 
the apo forms of the two RG1 3 mutants, kinetic charac- 
terization of RG13(I329W) and RG13(I329W/A96W) sug- 
gested that the initial stages of closing do not result 
in changes in the BLA domain that substantially affect 
catalysis (Figures 3B-3D). Both k^t and K„ improved 
during the intermediate stages of closing, but the major- 
ity of the effect on K„ occurred during the final stages 
of closing. 

As the magnitude of the allosteric effect was on the 
same order as that of many natural allosteric enzymes, 
we next examined the biological effects of RG13. The 
switching activity was sufficient to result in an observ- 
able phenotype: maltose-dependent resistance to ampi- 
clllin (Table 1). E. coli cells expressing RG13 had amini- 
mum inhibitory concentration (MIC) for ampicillin that 
was 4-fold higher in the presence of 50 ^.M maltose. 
In contrast, the addition of the same concentration of 
sucrose or glucose to a plate did not affect the MIC. 
Thus, RG13 serves to couple the previously unrelated 
functions of ampicillin resistance and maltose concen- 
tration. E. coli cells expressing RG13 function as a 
growth/no growth sensor for maltose. 



Significance 

We have shown that two unrelated proteins can be 
systematically recomblned in order to link their re- 
specthre functions and create molecular switches. A 
combination of random circular permutation and ran- 
dom domain insertion enabled the creation of a MBP- 
BLA fusion geometry in which conformational changes 
induced upon maltose binding could propagate to the 
active site of BLA and increase BLA enzymatic activity 
up to 25-fold. The functional coupling of two proteins 
with no evolutionary or functional relationship is a 
powerful strategy for engineering novel molecular 
function. For example, coupling a ligand binding pro- 
tein and a protein with good signal transduction prop- 
erties would result in the creation of a molecular sen- 
sor for the ligand. Furthermore, switches that establish 
connections between cellular components with no 
previous relationship can result in novel cellular cir- 
cuitry and phenotypes. We envision, for example, that 
such switches might establish connections between 
molecular signatures of disease (e.g., abnormal con- 
centrations of proteins, metabolites, signaling, or 
other molecules) and functions that serve to treat the 
disease (e.g., delivery of drugs, modulation of signaling 
pathways, or modulation of gene expression) and 
therefore possess selective therapeutic properties. 



Supplemental Data including Experimental Procedures and a figure 
are available at http://www.chembiol.com/cgi/content/Tull/11/11/ 
■■■/DC1/. 
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